LOCAL ENTROPY AVERAGES AND PROJECTIONS OF FRACTAL 

MEASURES 
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Abstract. We show that for families of measures on Euclidean space which satisfy 
an ergodic-theoretic form of "self-similarity" under the operation of re-scaling, the 
dimension of linear images of the measure behaves in a semi-continuous way. We 
apply this to prove the following conjecture of Furstenberg: if X,Y C [0, 1] are 
closed and invariant, respectively, under xm mod 1 and xn mod 1, where m,n are 
not powers of the same integer, then, for any t 0, 

dim{X + tY) = min{l, dim X + dim ¥}. 

A similar result holds for invariant measures, and gives a simple proof of the Rudolph- 
Johnson theorem. Our methods also apply to many other classes of conformal frac- 
tals and measures. As another application, we extend and unify results of Peres, 
Shmerkin and Nazarov, and of Moreira, concerning projections of products self- 
similar measures and Gibbs measures on regular Cantor sets. We show that under 
natural irreducibility assumptions on the maps in the IPS, the image measure has 
the maximal possible dimension under any linear projection other than the coordi- 
nate projections. We also present applications to Bernoulli convolutions and to the 
images of fractal measures under differentiable maps. 



1. Introduction 

1.1. Background and history. Let dim denote the Hausdorff dimension of a set 
and let 11^ ^ denote the space of orthogonal projections from to /c- dimensional 
subspaces, with the natural measure. Then it is a classical fact, due in various versions 
to Marstrand, Mattila and others, that for any Borel set X C M*^, almost every vr € Ild,k 
satisfies 

(1.1) dim(7rX) = min(A;, dim X). 

Indeed, the right hand side is a trivial upper bound: Lipschitz maps cannot increase 
dimension, so dimvrX < dimX, and irX is a subset of a /c- dimensional subspace, 
hence dimvrX < k. Since this equality holds almost everywhere, we shall use the term 
exceptional for projections for which equality fails, and call the right hand side the 
expected dimension. 
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While (jl.ip tells us what happens for a typical projection, it is far more difficult 
to analyze the image of even the simplest fractals under individual projections. See 
Kenyon pj| for a particularly simple and frustrating example. 

There are, however, a number of well-known conjectures to the effect that, for certain 
sets of combinatorial, arithmetic or dynamical origin, phenomena which hold typically 
in the general setting should, for these special sets, always hold, except in the pres- 
ence of some evident obstruction. The present work was motivated by a conjecture of 
this kind concerning projections of product sets whose marginals are invariant under 
arithmetically "independent" dynamics. 

Denote the m-fold map of the unit interval by : x ^ mx mod 1 and let TTx,TTy € 
112,1 denote the coordinate projections onto the axes. 

Conjecture 1.1 (Furstenberg). Let X,Y CI [0,1] be closed sets which are invariant 
under T2 and T3, respectively. Then 

dim7r(X xY) = min{l,dim(X x ¥)} 

for any vr G 112,1 \ Wx, %}• 

In the situation above it is evident that vTa;, vTy are exceptions, since they map X xY 
to X or Y, respectively, and a drop in dimension is to be expected. 

Note that this conjecture can also be formulated as a result on sumsets0 for X, Y 
as above and all s 7^ 0, 

dim(X + sY) = min{ 1 , dim X + dim 1"} . 

Here A + B = {a + b : a e A, b £ B}. 

Conjecture [LT] originates in the late 1960s. Although it has apparently not appeared 
in print, it is related to another conjecture of Furstenberg's from around the same time, 
which appears in [T2j : 

Conjecture 1.2 (Furstenberg, [l2])- Let X,Y CI [0,1] be closed sets which are invari- 
ant under T2 and T3, respectively. Then for any s,t, t 7^ 0, 

diraX n (s + tY) < max{dimX + dimY -1,0} 

The relation between these conjecture is as follows. The sets X Ci (s + tY) are, up to 
affine coordinate change, the intersections of X x y with the fibers of the projections 
^ G n2,i \ {vr^,vrj^}. Heuristically, one expects the following relation between the 
dimension of the image and the fibers: 

(1.2) dimX xY = dim7r(X x Y) + sup{dim(X x Y) n 7r-^(z)}} 



^In the sumset formulation we relied on the identity 

dim(X X y) = dim(X) + dim(F) 

This holds in the present case because X has coinciding Hausdorff and box dimension (see e.g. [T^ 
Theorem 5.1]); in general one only has the inequality dim(X x y) > dim(X) + dim(y). 
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This is, for example, the way afhne subspaces in behave under hnear maps, as 
do generic sub-manifolds, and if it were true then Conjectures 11.11 and 11.21 would be 
equivalent: if the image under a projection has the expected dimension then the fibers 
would behave as expected as well. 

Equation (|1.2p is a very strong statement, and simple examples show that it is not 
generally true. For quite general sets A C R'^ it is known that, under a natural distri- 
bution on the /c-dimensional subspaces which intersect A, almost every such subspace 
intersects A in the expected dimension, i.e. the larger of dimvl — k and 0; see [22l 
Theorem 10.11]. For certain special sets A C R^, related in some ways to the product 
sets we are discussing, a stronger result was obtained by Furstenberg [13]: for every 
t there are many (in the sense of dimension) vaues of s such that A intersects the 
line X — ty = s in at least the expected dimension. However, the uniform upper bounds 
needed for Conjecture 11.21 still seems out of reach of current methods. 

We refer the reader to [l2] for a more detailed discussion of Conjecture II .21 and some 
related questions. 

1.2. Iterated Function Systems. A related circle of questions concerns projections 
of product sets whose marginals are attractors of iterated function systems (IFSs) on 
the line. Here again it is believed that, in the absence of some evident "resonance" 
between the IFSs, projections should behave as expected. 

There was little progress on these problems until fairly recently. The first result of 
this kind is a theorem by C. G. Moreira |23|^ for pairs of regular IFSs, i.e. systems of 
C^"*""^ contractions on the line satisfying the strong separation condition (see Section 
[TT]) . Moreira assumes that at least one of the IFSs is strictly non-linear, i.e. cannot 
be conjugated to a linear one, and that a certain irrationality condition is satisfied 
between the IFSs. Under these hypotheses he shows that if X, Y are the attractors 
then the sumset X + Y, which is the projection of X x y under 7r{x, y) = x + y, has 
the expected dimension. 

More recently Y. Peres and P. Shmerkin |29| solved the problem for projections 
of X X y when X, Y are attractors of linear IFSs satisfying an irrationality condition, 
namely, that the logarithms of some pair of contraction ratios is rationally independent. 
This class of examples includes some special cases of Furstenberg's conjecture. For 
example, the standard middle-third Cantor set is both Ta-invariant and the attractor 
of an IFS with contraction ratio 1/3. With an eye to Furstenberg's conjecture, these 
methods can be pushed to apply to T^-invariant subsets of [0, 1] which are shifts of 
finite type with respect to the base-m coding. See also [TO] for an extension to some 
non-conformal attractors on the plane. 

With regard to the question of projecting measures rather than sets, Nazarov, Peres 
and Shmerkin [27] recently established some results for projections of Hausdorff measure 

^The proof in [23| is incomplete, see e.g. |29) 
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on X X y, where X, Y are now attractors of linear IFSs in which, additionally, all 
contracting maps have the same contraction ratio (while these assumptions are quite 
special, in this case they also establish a stronger result using correlation dimension 
rather than Hausdorff dimension). 

It is interesting to note that the methods of Moreira and of Peres- Shmerkin are quite 
different and rely heavily on their respective assumptions, i.e. strict non-linearity and 
linearity of the IFSs. This leaves open the case of a pair of IFSs which are both non- 
linearly conjugated to linear IFSs. Their methods also do not give any information 
about behavior of measures on regular IFSs. 

Finally, similar questions may be asked about multidimensional attractors of IFSs 
rather than products one-dimensional ones. For the case of 2-dimensional linear IFSs 
Peres and Shmerkin [29] showed that, assuming that the orthogonal part of the con- 
tractions include at least one irrational rotation, all projections behave as expected. 
Unfortunately these methods do not work for dimension d > 3, and again give no 
information about measures. 

1.3. Results. In this work we develop a method for bounding from below the dimen- 
sion of projections of measures which exhibit certain statistical self- similarity. Before 
describing the general result, we summarize our main applications. 

The first is a resolution of Conjecture II. II in its full generality. In fact, we establish a 
stronger statement concerning invariant measures. Recall that for a probability measure 
// on a metric space, the lower Hausdorff dimension dim^, ^ is defined as 

dim^, /i = inf{dim(A) : > 0}. 

Also, write dim^ = q to indicate that 

lim — = a tor ^u-a.e. x. 

rjo log r 

In this case a is the exact dimension of fi and dim* /u = dim//, but note that the dim/i 
is not always defined. See Section [3] for a discussion of dimension. 

Theorem 1.3. Let fi,u be Borel probability measures on [0,1] which are invariant 
under Tm,Tn, respectively, and m,n are not powers of the same integer. Then for 
every vr G 112,1 \ Wx,-^y}, 

dim* 7r(/i x u) = min{l, dim*(/i x i^)} 
If ^J-ji' are exact dimensional then the above holds for dim instead o/dim*. 

Note that both the conjecture above and the theorem hold trivially in dimension 
zero. From the theorem one proves the conjecture using the variational principle to 
relate the dimension of sets and measures; see Section riQ.4i 

Theorem 11.31 also leads to a very short proof of the Rudolph- Johnson theorem (see 
Section [10. 5|) : If m, n are not powers of the same integer, /U is a probability measure on 
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[0, 1] invariant under both and r„, and all ergodic components have positive entropy 
for one (equivalently both) of the maps, then fj, =Lebesgue measure. Unfortunately, 
neither this proof nor our methods provide a hint on how to approach the long-standing 
conjecture about the entropy zero case. 

For the next result we require some notation. For general definitions regarding 
IFSs see Sections [9l [TTl Given a contracting smooth map / on [0,1], we let A(/) = 
— log(/'(x)), where x is the fixed point of /. Furthermore, if I = {/» : i € A} is a 
regular IFS (see Section [TT] for the definition), we let 

L{I) = {A(/x.i o . . . o J : n G N, xi, . . . , x„ G A}. 

Theorem 1.4. Let = ^ff : j G A^j, i = 1, . . . ,d be regular IFSs with attractor 
Xi. Suppose the following holds: 

Minimality assumption. The set L{T^) x • • • x L{I'^) is dense in the quotient 
space (M'^,+)/A, where A is the diagonal subgroup o/M^. 

Then for any globally supported Gibbs measures fii on Xi corresponding to arbitrary 
Holder potentials, and for any projection Tr{x) = tiXi with all ti nonzero, 

dim (7r(/_ii x • • • x fia)) = min(l, dim(/ii) + . . . + dim(/Ud)). 

A classical result of R. Bowen shows that Hausdorff measure on a regular Cantor set 
is equivalent to a Gibbs measure for an appropriate potential. We therefore have 

Corollary 1.5. If Xi are attractors of IFSs satisfying the hypotheses of the theorem, 
then 

(1.3) dim7r(Xi x • • • x X^) = min{l,dimXi + . . . + dimX^^} 

The minimality condition in Theorem 11.41 is satisfied, for example, when there are 
d rationally independent numbers among the numbers A(/j*^). Thus Theorem 11.41 
(and its corollary) generalizes and extends the aforementioned results of Moreira [23] . 
Peres-Shmerkin [29] and Nazarov-Peres-Shmerkin |27j. We note that Theorem 11.41 
does not make any assumptions about the linear or non-linear nature of the IFSs, 
and neither does the proof, which provides a unified treatment of the known cases. 
We also remark that Moreira [private communication] has shown that, for d = 2, the 
minimality assumption holds automatically when one of the IFS is not conjugated to 
a linear IFS. In the linear case, however, the minimality assumption may fail to hold 
and is necessary; see [29] for a discussion. 

For self-similar sets and measures in (see Section (H), we have: 

Theorem 1.6. Let {fi : i £ A} be an iterated function system on M*^ with the strong 
separation condition consisting of similarities, and fi a self-similar measure on its at- 
tractor. Let Oi denote the orthogonal part of the similarity fi and suppose that: 

Minimality assumption. The action (by right composition) of the semigroup gen- 
erated by Oi on Ild,k is topologically minimal, i.e. for some (equivalently any) vr G 11;^^^ 
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the orbit 

{vrOii • • • Oifc : ii, • • • G A} 

is dense in Ild,k- 

Then for every map g : supp /i ^ M'^ without singular points, is exact dimen- 
sional and 

dim((7/i) = min(A;, dim/i). 

It is well known that, under the strong separation condition, for a self-similar set of 
dimension a, the a-dimensional Hausdorff measure on X is equivalent to a self-similar 
measure. Therefore the theorem implies a version for sets: 

Corollary 1.7. If X is an attractor of an IFS satisfying the hypotheses of the theorem 
above, then for every g G C^{X) without singular points, 

dim7r(X) = min{A;, dimX}. 

Finally, our methods also apply to certain problems involving non-smooth maps. 
Recall that the biased Bernoulli convolution with contraction < t < 1 and weight 
< p < 1 is the probability measure that is the distribution of the random real 
number Yl'^^=o ^t^, where the sign is chosen i.i.d. with probability p,l — p. One may 
view this as the image of the product measure {p, l—p)^ on {-|-, — }^ under the Lipschitz 
maps ipt{x) = Y.Xnt'^. 

The following theorem may be inferred from deep existing results, but follows easily 
from our methods: 

Theorem 1.8. The lower Hausdorff dimension dim* Vp is lower semi- continuous in 

I. 4. Local dynamics and continuity of dimension. The proofs of Theorems 11.31 

II. 41 11.61 consist of two independent parts. The first is a semicontinuity result for the 
map vr i— > dim,, vr^ when /i is a measure displaying a certain "local dynamics". Coupled 
with the general result that dim* vr/i has the expected dimension for almost-every vr, 
this provides an open dense set of projections which project to nearly the expected 
dimension. The second part of the proof relies on some invariance of the measures 
(and hence of the set of good projections) under a sufficiently large group to show that 
this open set, being invariant, is in fact all of Ild,k or some large part of it. While 
all related works utilize largeness of this action in one way or another, the continuity 
result is new and is perhaps the main technical innovation of this paper. We outline 
these results next. 

The "local dynamics" which we require of a measure /x is, briefiy, that as one zooms 
in to a typical point x, all the while re-scaling the measure, one sees a sequence of 
measures which display stationary dynamics. More precisely, given a measure fi and 
X G supp /i, one can form a sequence of cubes Bn descending to x. For example, given 
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an integer b one can choose the 6-adic cells containing x (later it will be necessary 
to work with more general cells; see Section [7|. Form the sequence of measures ^u^'"' 
obtained by restricting // to Bn, normalizing, and re-scaling it back to the unit cube. 
The sequence /i^'" is sometimes called the scenery at x. Our assumption about ^ 
will be that for /x-typical x, the scenery sequence displays statistical regularity, i.e. is 
generic for some distribution P on measures, and P is independent of x. 

The limiting distribution P above is the distribution of a so-called CP-chain (or a 
slight generalization of one), which were introduced by Furstenberg in [121 [13] for the 
purpose of studying some related problems. A CP-chain is a measure-valued Markov 
process (A*n)^i, in which ^„ are probability measures on the unit cube (or some other 
fixed compact set), and, conditioned on ^i, the sequence fi2,fi3--- is the scenery of 
jj-i at a /xi-typical point. See Section [7l Although CP-chains are quite special objects, 
in fact many measures that arise in conformal dynamics have CP-chains associated to 
them in a natural way, and are often equal to typical measures for CP-chains after 
slight distortion. 

Similar notions for measures and sets have been studied by many authors, mostly 
with the aim of classifying measures and sets by their limiting local behavior [TS, [H [2 
[20j [26j [251 [3] . See [16] for a systematic discussion of CP-chains and their relation to 
other models of "fractal" measures. 

Returning to dimension of projections, one might say that the discontinuity of dimvT/U 
in vr is a result of the infinitesimal nature of Hausdorff dimension. It is therefore 
desirable to express, or at least bound, the dimension in terms of a finite-scale quantity. 
For this purpose a useful quantity to consider is entropy: For a measure u on R'^ define 
the p-scale entropy of u by 



This measures how "spread out" u is, and its behavior as p ^ has been studied as 
an alternative notion of dimension (this is so-called entropy or information dimension, 
which again behaves discontinuously under projections). Our key innovation is to 
observe that, in the presence of local dynamics, the (mean) behavior of this entropy at 
a fixed finite scale can be used to bound the dimension of projections of the measure. 

Theorem 1.9. Let n be a measure on W^, fix an integer base b > 2 and let vr G Ild,k- 
Suppose that for fi-a.e. x, 





(1.4) 




n=l 



where ^u^'"" are the scenery of fi at x along b-adic cells. Then 



■*7r/i > - — - - — ^, 
log b log b 
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where the constant depends only on d,k. 

The use of 6-adic cells here is somewhat arbitrary, and we can (and will) sometimes 
use other filtrations. 

Observe now that Hp{'Ki') is (almost) jointly continuous in vr G 11^^^ and u a proba- 
bility measures on the unit cube (actually, it is discontinuous at atomic measures, but 
the error tends to zero as p — > 0). Hence for h large enough, as vr' — > vr, if one replaces tt 
with vr' in (11.41) . then the new averages also exceed a in the limit, and we obtain a lower 
bound dimvr'/U > a for vr' close enough to //. Also note that for measures displaying 
good local dynamics, the limit (|1.4p converges to the mean value of Hi/i,{7ri^), with 
v distributed according to the limiting CP-chain of fi. Combining Theorem 11.91 with 
some additional analysis leads to the following theorem. 

Theorem 1.10. Let P he the distribution of an ergodic d-dimensional CP-chain. Then 
for every k there is a lower semi- continuous function E : Iid,k such that: 

(1) E{Ti) = min{k^a) for almost every vr G ^d,k, where a is the P-almost sure 
dimension of measures in the chain (See Lemma \7.9\) . 

(2) For a fixed it € ^d,k, 

dim* vr^ = -E(vr) for P — a.e. fi. 

(3) There is a set M of measures with P{M) = 1, such that, for jj, € M , 

dim* TTfj, > E{tt) for all vr e n^^fc- 

In fact, E{tt) is the limit, as p ^ 0, of the mean values of the Hp{7ri^), the mean 
being over under the distribution P. 

The following corollary is then immediate: 

Corollary 1.11. In the setting of Theorem \l.l(K there exists a set M of measures with 
P{M) = 1, such that for every e > there is a dense open set C 11^ ,1. satisfying 

dim* vr/u > min(A;, a) — e for all vr G Z^^, G M. 

A weaker result for non-ergodic processes is available, see Theorem 18.31 below. 

Since differentiable maps are locally close to linear ones, and limits of averages along 
sceneries at x depend only on the local behavior of the measure near x. From this we 
obtain results on non-linear images of measures: 

Theorem 1.12. Let P he the distribution of an ergodic CP-chain. Fix tt G n^^ fc- Then 
for P-almost every /x, the map g ^ dim* gjj. is lower semi- continuous at n in the 
topology. 

Furthermore, the modulus of continuity is uniform in fi: for every e > there is 
a 6 > so that for a.e. fi if g ^ C-'^(supp;U, M'^) and \\g — tt\\(j-i < 6 then dim* g/x > 
dim* TT/i — e. 
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Theorem 1.13. Let E : Hd.k — > M 6e the function associated to an ergodic CP-chain 
as in Theorem \1.10[ and let n he typical measure for the chair^ Then for every map 
g : supp ij, ^M.^ without singular points, 

In fact, the above theorem as well as other results involving smooth functions require 
only differentiability, but for simplicity we present the proofs in the case. 

A number of natural questions arise in connection with Theorem ll.lOl and the almost- 
everywhere nature of results for CP-chains. In particular, results which, for each vr, 
hold almost surely do not a priori hold almost surely for all tt. Another issue we do not 
resolve here is the behavior of the upper Hausdorff dimension. Some of these issues are 
addressed in [TB] , 

1.5. Outline of the paper. Section [2] introduces some general notation. 

Section [3] recalls the notions of entropy and dimension and some of their properties. 

In Section H] we study measures on trees and obtain bounds on the image of such a 
measure under a tree morphism. 

In Section [5] we develop machinery for lifting geometric maps between Euclidean 
spaces to morphisms between trees. 

Section [6] discusses Bernoulli convolutions and Theorem 11.81 

In Section [7] we define (generalized) CP-chains. 

Section [8] contains semicontinuity results for images of typical measures for CP- 
chains. 

In Sections [U [10] and dU we prove Theorems 11.61 11.31 and 11.41 respectively. 
2. General notation and conventions 

N = {1, 2, 3 . . .}. In a metric space, Br{x) denotes the closed ball of radius r around 
X. We endow with the sup norm = max{|xi|, . . . \xd\} and the induced metric. 

All measures are Borel probability measures and all sets and functions that we work 
with are Borel unless otherwise noted. The family of Borel probability measures on a 
metric space X (with the Borel cx-algebra) will be denoted V{X). 

Given a finite measure // on some space and a measurable set A, we write 

This is the conditional probability of // on A. 

We use the standard "big O" notation for asymptotics: x = Op{y) means that 
X < Cy, where C depends on the parameter p. Similarly x = 0,p{y) means y = Op{x), 
and X = @p{y) means x = Op{y) and y = Op{x). 

'^A priori the function E is defined only on \ld,k- However it can be extended to all linear functions in 
a straightforward way so that Theorem 11.101 still holds. Alternatively, one can identify D^g with the 
projection onto the fc-plane orthogonal to the kernel of D^g. 
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For the reader's convenience, we summarize our main notation and typographical 
conventions in the following table. 



d 


Dimension of the ambient Euclidean space. 


k 


F)iTTiPTiRinTi of tVip raTif^p of a nrniprf^inn 


\ \ ^ U 

■''■'-a, ft 


Space of projections from M'^ to /c- dimensional subspaces. 




Orthogonal projections. 


f. Q. h 

J 5 if 5 


Morphisms between trees (Section 14. ip. differentiable maps. 


Qi. 0. ^ 


Dimension of fractal sets and measures. 


U. Tl^ T 


Probability measures. 


P 


Probabilitv distribntioTiR on larp'P snares ( e snares of measures 1 


dim 


Hausdorff dimension of a set, exact dimension of a measure. 


dim^ , dim* 


Upper/lower Hausdorff dimension of measures (Section 13. ip. 


diin Him 


TT'H'npr /lowpr ■nomfwiRP nimPTiRinTi of a mpasiirp (Sppfinn 1 i 

KJ L/L/V^A / LKJ W KjL. vJ W1J.1L VV lOV^ U.11J.1V^1J.1J1 WIJ. Wl tli lJ.lV^CIjkJ Lll I kJ • L± W J.1 . -L I 1 • 


rlim Him^ 


Tlnnpr/lowpr EiTitronv HimpriRinn of a mpaRiirp ( Sprtion 1.^ .^h 

KJ I-' y^'l. j LKJ W J — /J-lljl ^ y VJ.llJ.lV^lJ.01 WIJ. W J- (li IJ-lV^CIjlJ I I J- V 1 kJ V^V^ LI WIJ. 1 ■ tJ 1 > • 




/?-scale entropy of a measure (Section I3.3jl . 




Entropy of /i w.r.t. partition 7^ [of random variable (Section 13.3 


A, Av 


Symbol set of a tree [or of the tree index set of an IFS. 


^ O") ^ v*^y 


6-adic cell [containing x\ (Section 13. 2p . 


X Y 


Tree f Section [¥11) or attractors of an IFS (^Section lOTl 11 1 IH 

-I- J- V V 1 kJ V ' Ll WIJ. I^.-LI) KJl. CIjLLI ( J. v . L Wl O W J- CljlJ. X J. kJ \ kJ L± W J.1 IL/.-LL l_L-L.-LII. 


X v 


Points in tree or attractors of an IFS. 


Q }) 


Finite words 

J- J.XJ.J. L V> VV W J- VAkJ ■ 


[am 


Cylinder sets in a tree. 


A* 


Re-scaled version of A (Section 17.21). 


Ta 


Re-scaling homothety, mapping A to A* (Section 17.21). 




Conditional measure on A (Section l7.3p. 




Conditional measure on A, re-scaled to A* (Section 17.31). 


A,^ 


Partition operator and family of boxes (Section 17.41). 


UMe 


Subsets (often open) of H^^fc or Ci(M'^, M'^'). 



3. Dimension and entropy 

We denote the Hausdorff dimension of a set A by dim A. Falconer's books [5], [6] 
are good introductions to Hausdorff measure and dimension. In this section we collect 
some basic facts about dimension and entropy of measures. 

3.1. Hausdorff and local dimension of measures. Let be a Borel measure on a 
metric space X. The upper and lower Hausdorff dimensions of fi are given by 

dim*(^) = inf{dim(yl) : ^l{X\A) = 0}, 
dim*(/i) = inf{dim(^) : ^m{A) > 0}. 
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The upper and lower local dimensions of /i at a point x are given by 

dim( u, x) = lim sup — — ^ 

r^o log r 

dim(/i, x) = lim inf ■ ^'^ 



r- 



+0 log r 



Clearly dim (p, x) < dim{^,x). The local dimension of /i at x exists if dim(^,x) = 
dim (u, x), and is equal to their common value. If the local dimension of /i exists and 
is constant ^-almost everywhere, /i is exact dimensional, and the almost sure local 
dimension is denoted dim^. Whenever we write dim/i, we are implicitly assuming 
that /i is exact dimensional. Note that dim^, ^ = dim* fi does not imply that fi is exact 
dimensional. 

We record a few basic facts about lower dimension. 
Lemma 3.1. Let be a Borel measure on a metric space. 
(1) 

dim^(;u) = sup{a : dim(/i, x) > a for n-a.e.x}. 

(2) If P is a distribution on measures and fJ' = J vdP{v), then 

dim* /i > essinf i,^p dim* v 

(3) If are equivalent measures (i.e. mutually absolutely continuous), then 
dim* fi = dim* u. 

Proof. See e.g. [H Proposition 10.2] for the first assertion; the last two are easy conse- 
quences of the definition. □ 

3.2. p-adic cells and regular filtrations. Given an integer base p > 2, we denote 
by Vp the partition of R'^ into cubes of the form /i x . . . x with Ij = [|, ■^). Note 
that Vpk, k = 1,2, . . . form a refining sequence of partitions that separates points. A 
cube in (JfcLi '^p'' is called a p-adic cube. 

A sequence = {Tn)'^=i of partitions of a region in is p-regular if, for some 
constant C > 1, every B ^ J^n contains a ball of radius p"'/C and is contained in a ball 
of radius C ■ />". For example, this assumption is satisfied by J^n = T^p" with p = 1/p. 

For a partition V of E C and x & E, we write 'D{x) for the unique partition 
element containing x. 

The proof of the following can be found in [311 Theorem 15.3]: 

Lemma 3.2. Let p. be a measure on and T = (JF„)^^ a p-regular filtration. Then 
for p-a.e. x, 

log p{Tn{x)) 



dim(^, x) = lim sup 

n^oo 

dim(/x, x) = lim inf 



n^oo n log p 

log/u(jr„(x)) 

n— >oo 



n log p 
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3.3. Entropy and entropy dimension. The various notions of dimension aim to 
quantify the degree to which a measure is "spread out". One can also quantify this 
using entropy. Given a probabihty measure /i on a metric space X, the r-scale entropy 
of fi is 

HrifJ.) = - j log{n{Br{x))dn{x). 
The upper and lower entropy dimensions of /x are defined as 

dime(/i) = hmsup — 

r^o - log r 

dmi„( Li) = hminf — 
e^^^ r-,0 -logr 



Clearly dim^fi < dime^. 

Entropy dimensions can also be defined in terms of entropies of partitions. Recall 
that if /X is a probability measure and Q is a finite or countable partition, then 

H{fi,Q) = -Y,KQ)^ogfi{Q), 

with the convention that OlogO = 0. This quantity is called the Shannon entropy 
of the partition Q. When X is a random variable taking finitely many values, it 
induces a finite partition of the underlying probability space. We then write H{X) 
for the Shannon entropy of this partition, with respect to the associated probability 
measure. For the basic properties of Shannon entropy, and in particular the definition 
and properties of conditional entropy, see [4]. 
Next, we specialize to M.'^. 

Lemma 3.3. Let ^ be a probability measure on M.'^ and p > 2. Then there exists a 
constant C = C{d,p) such that for all k gN, 

\H^-,{li)-H{ix,V^,)\<C. 

In particidar, 

dime (/i j = nm sup — ; , 

fc^oo fclogp 

H{fi,V k) 
dimg(/i) = liminf ■ 



fc-^oo klogp 

Proof. This is proved for p = 2 in [30l Lemma 2.3]; the general case is exactly analogous. 

□ 

The following proposition summarizes some of the relations between the different 
notions of dimension. Proofs can be found in 

Proposition 3.4. Let ^ be a measure on M". Then: 



(3.1) dim*(/i) < dim ^iu) < dime(/i). 
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// /i is exact dimensional, then 

(3.2) dim*(;u) = dim*(/i) = dim^ (p) = dime(/u) = dim/i. 
The next two lemmas establish some continuity properties of Hr{^) 

Lemma 3.5. Let n he a prohahility measure on W^, and fix a constant C > 0. Then 

\Hrifi) - Hcr{f^)\ = Oc,d(l) for all r > 0. 

Proof See [301 Lemma 2.3]. □ 

Lemma 3.6. Let n he a prohahility measure on the unit hall Bi{0), and let vr € Ild,k- 
Then for any function g -.W^ ^M}' such that sup^ \\Dxg — 7r||oo < ^ , 

\HriTTfl)-Hr{gfi)\=Od,k{'^). 

Proof. After a translation we can assume g{0) = 0. Then it is easy to see that 

7r-\Br{x)) C g-\Bo,^,ir)ix)) 

and similarly with vr and g exchanged. The lemma now follows from Lemma 13.51 □ 

3.4. Behavior of measures under orthogonal projections. The family Ud^k of 
orthogonal projections from to its fe- dimensional subspaces may be identified with 
the Grassmanian of fe-planes in W^, with y C R"^ corresponding to the projection to 
V. This endows Ild,k with a smooth structure, and hence a measure class (there is 
also a natural measure, invariant under the action by the orthogonal group, but we do 
not have use for it). Different projections have different images, but it is convenient 
to identify them all with M'^ via an affine change of coordinates, which we specify as 
needed. Such an identification is harmless since it does not affect any of the notions of 
dimension that we use. 

If C M" is a Borel set, then the well-known projection theorem of Marstrand- 
Mattila says that 

dim(7r(£')) = mm{k,dim{E)), 

for almost every projection vr G Ild,k- Hunt and Kaloshin [T^ established the analogous 
results for dimensions of a measure [TTl Theorem 4.1]. In particular we shall use the 
following: 

Theorem 3.7. Let ix he an exact- dimensional measure on . Then for almost every 
projection tt G ^d,k the projection nfi is exact dimensional, and 

dim(7r/i) = min(/c, dim/i). 

Corollary 3.8. Let ji he an exact- dimensional prohahility measure on . Then for 
almost every vr € 11^,^, 

(3.3) lim J ^ = min(/c, dim a) 

^ ' r^O logr 
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Proof. This follows from Proposition 13.41 and Theorem 13.71 □ 

If /i is a measure on M*^, then for every tt G 11^^^ and x G supp(7r/u) we have 
dim(7r^, vrx) < dim(7r,x). In particular, if dim/U = a, then dim(7r/u,y) < a for (7r/i)-a.e. 
y. Also, dim(^, x) < d almost everywhere. 

Finally, combining the above with Lemma ISTTlfT]) . we have: 

Corollary 3.9. // ji is an exact- dimensional measure on and f : ^ M.^ is a 
differentiable map, then in order to prove that 

dim ffj, = min{/c, dim;^}, 

it suffices to prove that 

dim^ ffi > minj/c, dim /j,}. 

4. Trees, local entropy and dimension 

In this section we use entropy and martingale methods to bound from below the 
dimension of the image ffi of a measure fi under a map /. While simple, the result 
appears to be new; it may be viewed as a variant of the relative Shannon-McMillan- 
Breiman with stationarity replaced by an assumption of convergence of certain averages, 
and with very few requirements of the "factor" map. It may also be viewed as a relative 
version of |13l Theorem 2.1]. Here we formulate the result for morphisms between trees. 
In the next section we discuss how to lift more general maps to such morphisms. 

4.1. Trees and tree morphisms. A tree is a closed subset X C A^. Here A is a 
finite set called the alphabet or the symbol set. We usually do not specify the symbol 
set of a tree and write it generically as A; if we wish to be specific we write Ax, Ay, 
etc. 

A tree is a compact metrizable totally disconnected space. A basis of closed and 
open sets of A^ (and, in the relative topology, for X) is provided by the cylinder sets. 
An n-cylinder is a set of the form 

[a] = {x G X : xi . . . Xn = a} 

for a G A", and an n-cylinder in a tree X is the intersection of X with such a set. 

The cylinders form a countable family which is partially ordered by inclusion, and 
this order structure determines X up to isomorphism (see below for the definition of a 
tree morphism). Conversely, any countable partially ordered set satisfying the obvious 
axioms gives rise to a tree, and we shall sometimes represent a tree in this way. 

There is a closely related representation of a trees as sets of words. Write A* = 
IJ^o^" (^^ write for the empty word). A tree X C A* is characterized by the set 

{a G A* : [a]nX ^ 0}. 
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In this representation we call the sequence a G A" a node of the tree. Its length is n 
(denoted \a\). If 6 G A and ab is in the tree then ab is the child of o and a is the parent 
of ab. We shall sometimes use the notation 

ai = aia2 ■ ■ .a^ 

to represent the initial /c-segment of a longer word a. 

Definition 4.1. If X,Y are trees then a morphism is a map f : X ^ Y that maps 
n-cylinders into n-cylinders, i.e. for all o G A^ there exists b G Ay such that /[a] C [6]. 

In the symbolic representation of trees this corresponds to a map g : A*^ Ay 
satisfying g{ai . . . an) = g{ai . . . o„_i)6 for some b G Ay. 

4.2. Local entropy averages and mass decay. Let // be a Borel probability measure 
on a tree X. For x G X denote by the conditional measure on the symbol set 

A given by 

M^n+l|:rJ = -^. 

This is defined only when //[x"] > 0. The n-th information function is 

In{x) = -log/i(x„|x""^). 

Thus for a; G X, 

n 

(4.1) -log^[xi...Xn] =^/fc(x). 

k=l 

Let Xn be the random variable given by projection from X to the n-th coordinate 
and the ci-algebra generated by the Xi,X2, . . . ,Xm i.e. by the n-cylinders. Then 

H{Xn+l\x^) =Wn+l\J'n){x). 

Lemma 4.2. For fj,-a.e. x, 

1 1 ^ 

--iog/.[x^] - _^F(x„|xri) - 0. 

n=l 

Proof. Using (14. ip we can write this expression as 

1 ^ 

-Y,{In-Wn\Tn-l)){x). 
n=l 

This is an average of uniformly L^-bounded martingale differences, so by the Law of 
Large Numbers for martingale differences (see e.g. [HI Theorem 3 in Chapter 9]), it 
converges to a.e. □ 

Now suppose / : AT — > y is a tree morphism, and denote by / also the induced 
symbolic map A^^ Ay. Let v = f/j., define the conditional measures u{-\y'^), in G Ay, 
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as above, and let K„ denote the coordinate functions on Y. Since / is a morphism, we 
also have the conditional measures on Ay given by 

and we have the corresponding entropy 

HiYn\xr'). 

Note that u{-\x'l~'^) can also be thought of as the push-forward ffi{-\x^ which is 
well defined as a measure on Ay because / is a tree morphism. 

Theorem 4.3. Suppose that 

1 ^ 

liminf-^i7(y„K-')>a 

n=l 

for ji-a.e. x. Then for v-a.e. y we have 

liminf — (— log > a. 

Proof. It is convenient to identify (Y, v) with a sub-cr-algebra Q of (X, and think of 
Yn as random variables on X. We write Qn for the cr-algebra generated by il, . . . ,1^, 
so Q = \/ Qn- Note that Gn ^ ^n- 

By the lemma above, it suffices to show that 

1 ^ 

(4.2) liminf -^i7(y„|yri)> a 

iv— »oo iv ' 
n=l 

/x-a.e. Define hn '■ X ^ by 

hn{x) = H{Yn\x\-^) 

Since is the mean of the distributions v{-\x'^~^) over all x G f~^y, the con- 

cavity of the entropy function in the distribution implies 

H{Yn\yr^)> EiKlGn-i), 
and both are ^„_i-measurable. On the other hand, 

1 ^ 

- ^ (E(/l„ I Gn-l) - E{hn \g))^0 

n=l 

since these are averages of bounded martingale differences. By the hypothesis, 

1 ^ 

liminf — > a, 

n=l 



where k 



log r 
logp 
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SO, conditioning on Q, 

1 ^ 

lim inf — V E(/i„ \g)> a. 

n=l 

Combining this with the two previous inequahties, we have 

TV N 

Hminf - H{Yn\yr') > liminf - J] E(/j„ \g) > a, 

n=l n=l 

which is what we wanted. □ 

4.3. Metric trees. For < p < 1, a p-tree is a tree X together with the compatible 
metric 

Note that if X is a p-tree and y is a r-tree then a tree morphism X ^ y is p/r-Holder. 
In particular if r = /> then it is Lipschitz. 

In a /9-tree the diameter of an n-cylinder is p", and 

Br{x) = \xx . . .Xfc], 

Thus if /U is a measure on X then 
lim inf i^g^^lM = lim inf ^^^"^ " " " ^"^^ , 

n^oo log r n-»oo n log p 

and likewise for lim sup. In particular. l3.1t fT]l yields 

Lemma 4.4. // [i is a measure on a p-tree X such that 

,. . r f log/i([xi . . .x„])\ 

limmf > Q fi-a.e., 

n^co y n J 

then dim* p, > 

The metric we choose for a tree is somewhat arbitrary, and we may change it at our 
convenience, but one must note that this leads to a re-scaling of dimensions. 

Recall the definition of Hp{p) for a measure /i on a metric space (section [S.Sp . In 
the case of a p-tree X with a probability measure p we see, writing again Xn for the 
coordinate functions, that 

Hp{p) = H{Xi), 

and more generally, 

Hpn+i{p[x-^...x„]) = H{Xn+l\xi . . .Xn), 

where pA = y^p^/^U usual. If / : X — > y is a morphism of p-trees and u = f p, we 
have 

Hpn+l{fp[x,...x„]) = H{Yn+l\xi . ..Xn). 

We can thus re-formulate Theorem 14.31 as follows: 
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Theorem 4.5. Let X ,Y be p-trees and f : X ^ Y a tree morphism, and let fj, be a 
probability measure on X. If 

1 ^ 

liminf — ^i7pn+i(///[^^...^„]) > a 

n=l 

for fi-a.e. x, then 

log(l/p) 

The significance of this theorem is that one obtains a lower bound on the dimension 
of the image measure //i, in terms of an asymptotic property of the measure in 
the domain. Because the map is not one-to-one, a lot of structure is destroyed in 
the passage from to f^i, and it may be impossible to find enough structure in fji 
to analyze it directly. Instead, this theorem allows one to use structural information 
about /X and the way / acts on cylinder sets in order to bound dim* f fx from below. 

5. Lifting maps to tree morphisms 

In order to make use of the last section's results it is necessary to lift topological 
maps (between trees, Euclidean domains, or a mixture of the two) to tree morphisms. 
This technical section provides the tools for this. 

5.1. Base-jo representation. Given p >2 one can represent [0,1] using a p-regular 
tree: u : {0, . . . ,p — 1}^ [0, 1] is given by 

oo 

n=l 

We shall always give the full tree {0, . . . ,p— 1}^ the metric di/p, under which u becomes 
1-Lipschitz. Similarly the base-p representation of the cube [0, l]'^ is given by the tree 
({0, . . . ,p — 1}'^)^ with the map {x^, . . . , x"^) i-^' {u{x^), . . . , u{x'^)) and the metric di/p, 
under which this map is again 1-Lipschitz with respect to the ||-||j^-norm on the range. 

5.2. Faithful maps. Below we introduce a class of maps which do not distort dimen- 
sion very much. 

Definition 5.1. Let X be a p-tree. A map f : X —fW^ is C -faithful if for each n and 
each o G A" the following conditions hold: 

(1) Multiplicity: No point in f[a] is covered by more than C sets f[ab],b € A. 

(2) Decay: f[a] contains a ball of radius {C~^p)"' and is contained in a ball of 
radius {CpY. 

For example, the base-p coding of [0, 1]*^ is C-faithful for C = 2^ . 
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The second condition in the definition imphes that a C-faithful map of a p-tree is 

log(l/p) • 



(1 — \ )-Holder. Therefore, if is a measure on X, then 



dim,(//i) < - — dim^n. 

In apphcations, C will be independent of p and we will be free to choose p to be very 
small (for example, representing points in [0, l]'^ using a large base); then the bound 
above approaches dim,, p. Similarly, the following provides a lower bound: 

Proposition 5.2. Let p be a measure on a p-tree X and suppose f : X ^ is 
C -faithful. Then 

Oc,d(l) 



dim*(//i) > dim*(;u) 



log(l/^)- 



Proof. We use the characterization of dim* given in Lemma [3.1t fT]). Given e > 0, 
Egorov's theorem yields a closed set C X of measure at least 1 — e such that 

. Aogp\E,{Br{x)) 

limmi > dim*(;Uj uniformly m x € 

rjo log r 

Since it is enough to prove the desired result for f{p\Ee) for each e > 0, we can assume 
without loss of generality that there is uniformity already for p and, in particular, there 
is G N such that 

(5.1) p[a] < pCdim.CA')-!)" 

whenever a G A" with n > N. 

Fix X G X and write y = f{x). Pick n> N , and let 

cI> = {aeA" : f[a]r\Bpn{y)^0}. 

By the decay hypothesis, each f[a],a G <I>, contains a ball of radius {C~^p)^ and is 
contained in a ball of radius {C p)'^ . In particular, /[a] C B(^ij^2C")p"iy)- On the other 
hand, by the multiplicity assumption, no point can be covered by more than of the 
sets f[a\,a € ^. Hence, writing A for Lebesgue measure on W^, we have 

^{c-'prm < ^ < ((1 + ^c^P-r ■ 

Therefore |$| < exp(Oc,d{n)), and using (15.11) we deduce that 

{fp){Bpn{y)) < |$|max/i[a] < p''-^'^'^*^^^-^iuTF)\ 
Letting n ^ oo we conclude that 

Oc,d(l) 



dim(//x,y) > dim*(/u) 



log(l/p) ■ 

In light of Lemma [SmiT] ) ■ this yields the desired result. □ 
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Note that if X is a p-tree then for a € A", 

Hpn+i (^[a] ) = - X] [°^^] ^^[a] [a^] ' 

feGA 

i.e. this is the Shannon entropy of /Uj^] with respect to the partition induced from a's 
children. 

The importance of the following estimate is that it is independent of p. 



Proposition 5.3. Let jj be a measure on a p-tree X and suppose f : X ^ M.^ is 
C -faithful. Then for any n-cylinder a, 

|ffpn + l(/i[a]) - Hpn + l{ffl[a])\ < 



Proof. Since /![„] is a C-faithful map on the p-tree [a] (with the re-scaled metric 
p~'^dp{-, •)) we may without loss of generality assume that n = 0, a = empty word, so 
we must prove 

\Hp{p)-Hpifp)\<Oc,ki^). 

Notice that 

Hpip) = - j \og{p[xi])dii{x), 
Hpifp) = - j \ogp{Bl{x))di,{x), 

where 

Bl{x) = f'\Bp{f{x)). 

By the decay assumption in Definition 15.11 f[xi] is contained in a ball of radius Cp, 
and therefore [xi] C whence Hcp{fp) < Hp{p). It then follows from Lemma 

[33] that 

Hpifp) - Hpif,) < Oc,fc(l). 

For the other inequality, let 

Aia) = {be A -.distif [a], f[b])<p}. 

A volume argument like the one in the proof of Proposition 15.21 yields that 6 G A 
belongs to at most Oc,k{^) of the sets A(a). Since clearly 

bU^)<z U 
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we can estimate 



H,{^,) - Hpif^i) = / log 



dn{x) 



< 



< 



fiBl{x) , 
fi[xi\ 

fi[xi] 

= E E 

aeAfegA(a) 



dfi{x) 



□ 



5.3. Lifting maps to tree morphisms. The following technical result decomposes 
a map into a tree morphism, which is easier to analyze, and a faithful map, which by 
Proposition 15.21 has little effect on dimension. 



Theorem 5.4. Let X be a p-tree and f : X 
a commutative diagram 

X ' 



an L-Lipschitz map. Then there is 




where: 

(1) Y is a p-tree. 

(2) g is a tree morphism. 

(3) h is Ok^Li^)-f(iithfid. 

(4) If p is a measure on X, then for any n-cylinder [a] C X , 



H pn + l 



Proof. We first note that by rescaling the metric on the range M^, we may assume that 
/ is 1-Lipschitz. This rescaling affects the implicit constants in the O notation, but as 
we allow them to depend on L, we obtain an equivalent statement. 

The construction of Y and the associated maps consists of two parts. 

Step 1: construction of Y and h. Since / is 1-Lipschitz and diamX = 1, we may 
assume that the image is contained in [0, l]'^. Note that the cube Q = [0, l]'^ has the 
property that, given N, it can be covered by 2''N^ closed cubes Qq, . . . , Q2'=Af'=-i ^ Q 
such that 

• Each Qi has side length 

• No point in Q is covered by more than 2*^ + 1 of the cubes Qi, 
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• li Q' Q Q is a cube of side length < then Q' C for some i. 

The same holds for any other cube, with side lengths scaled appropriately. 

For example, for A; = 1 cover [0,1] by 2N intervals of length starting at the 
rational points i/2N. 

Let {A''„} be an integer sequence taking values in {[p^^J, [p^^\ + 1}, such that for 
P„ = lYl^^ Ni we have ^ < p"Pn < 1 (if I//O G N we can take Nn = l/p and then 

Let Y be the tree such that each vertex of level n has offspring, numbered 

0, . . . , I'^Nj^ — 1 (when l/p G N this is a regular tree). 

We inductively construct a map h which assigns to each cylinder set [a] C y a 
cube h[a] C [0,2]^ of side length 2P-^. We start by h[0] = [0,2f. Suppose that Q = 
h[yi ■ ■ - yn] has been defined and is a cube of side length 2P~^. Let Qo ■ ■ ■ Qon'' -1 ^ Q 

n + l 

be the sub-cubes of Q with properties analogous to those listed above for the unit cube. 
For y G {0, . . . , 2A^^+i - 1} set h{[yi . . . = Qy. 

Finally, this defines h by 

00 

{h{y)}=f]h[yi...yn]. 

n=l 

It is easy to see that, since h respects inclusion for cylinder sets, the intersection of the 
right-hand side is a single point. From the construction it is easy to check that h is 
Ofc(l)-faithful. 

Step 2: defining the morphism g : X ^ Y. It is more convenient to work with 
the symbolic representation: we define a morphism g : A*^ Ay so that f[a] C h[g{a)]. 
This clearly implies that f = hg. 

We proceed by induction on the word length. Start with g{0) = (corresponding to 
g{X) C Y). Suppose we have defined g{xi . . . x„) = yi ■ ■ .yn and the cube h[yi . . . y^], 
which has side length 2P~^, contains f[xi...Xn]. Since / is 1-Lipschitz, for each 
a G Ax the set f[xi . . . x„a] is contained in a cube of side length p"+^ < P^+i, i.e. 
l/2A'^„+i times the side length of h[yi . . .yn]- Thus by construction of h there is at 
least one 6 G {0, . . . , 2N^^-^ — 1} such that the cube h[yi . . . contains f[xi . . . Xna\; 
set g{xi . . . XnO) = yi . . . ynb. 

This completes the construction of Y and of g, h. 

Finally, the entropy estimate is a consequence of the commutativity of the diagram, 
the faithfulness of h and Proposition 15.31 □ 

6. SeMICONTINUITY of DIMENSION: BERNOULLI CONVOLUTIONS 

As a warm-up we demonstrate in this section how the methods introduced so far can 
be used to obtain semi-continuity of the Hausdorff dimension of Bernoulli convolutions 
in the parameter space (Theorem II. Sp . Recall that for < t,p < 1 the Bernoulli 
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convolution is the distribution of the random real number 

oo 
n=0 

where the signs are chosen i.i.d. with marginal distribution (p, 1 —p). The parameter 
t is called the contraction ratio. 

Bernoulli convolutions have been studied extensively. It is known that, with p = ^, 
almost every t € [|, 1) leads to a measure which is absolutely continuous with respect 
to Lebesgue (in particular, it has dimension 1), with similar results available for other 
values of p in a smaller range of t . See |28| and references therein for further background. 

Theorem 11.81 can be inferred from a combination of existing results. In |30j it is 
shown that dime(t'f) exists and is given by the supremum over a countable family of 
continuous functions of t and p, implying that dime(i/f ) is lower semicontinuous in t,p. 
On the other hand it is a rather deep fact that is exact-dimensional; see [9j for 
a careful argument. Combining these with Proposition 13.41 we find that dim(ff ) is 
lower-semicontinuous. Theorem 11.81 provides a direct argument for the semicontinuity 
of Hausdorff dimension. We note that a simple ergodicity argument in the coding space 
shows that dim^,(t'f ) = dim*(ff) for all t,p, so the result for lower Hausdorff dimension 
implies it for the Hausdorff dimension. 



Proof of Theorem HOI Fix (to,Po) G (0, 1) x (0, 1) and e > 0, and choose N such that 



Write A = {—1, 1}^. Given u = [uq, . . . ,un-i) S A, let Pt{u) = X^j=o^ define 
vTj : A^ ^ M by 

oo 

7ri(x) = ^Pi(x,)f^. 

i=0 

Let /i^ be the product measure on A^ whose marginal is 

=pli*^"'=i}l(l -p)l{^^^»=-i}l for n e A. 

Then t'f = Trt{n^). It is not hard to check that {t,p) H^n{v^) is continuous. Thus 
there is a small square Q =[tQ — S,to + 5] x \po — d,po + 5] (with 6 depending only on 
e since N = N{e)) such that, for {t,p) £ Q, 



7 := iVlog(lA) (dim,(z.f;) - 2e) < H,n{i 
Fix {t,p) G Q, write = /x^ and vr = irt, and set p = t^. To complete the proof, it is 



enough to show that 



dim,(zyf ) > dim,(i/,^«) -2e- 0{l/N). 
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The implicit constant in 0{1/N) will depend on to, but since can be taken arbitrarily 
large given (poj^o); this is of no consequence. 

Note that, thinking oi X = as a p-tree, vTf becomes Lipschitz and we can apply 
Theorem [531 to obtain X 

Since /i is a product measure, we may identify ;U[„] and // under the natural identifi- 
cation of [a] with the full tree. Also, 

where S : M — M is a homothety that scales by t"^^"'^ Since translations do not change 
entropies, we conclude 

Hp\a\+i (vr/x[„]) > 7. 

By Theorem [Ol 

Hp\a\+i {gf^ia]) > 1-0(1). 
This holds uniformly for all a E A*. Hence, using Theorem 14.51 



Finally, by Proposition 15.21 

dim*(ff) > dim*5/i 



> 



A^log(lA) 
7-0(1) 



iVlog(l/t) 
>diin,«»)-2e-0(l/iV) 
>dim,(z.P«)-2e-0(l/iV), 

where we used Proposition 13.41 in the last line. This completes the proof. □ 



7. CP-CHAINS AND LOCAL DYNAMICS 

In this section we formalize the notion of local dynamics of a measure along a fil- 
tration. We then introduce a slight generalization of Furstenberg's CP-chains, which 
provide a rich supply of measures with good local dynamics. 

We adopt the convention that a measure refers to a probability measure on Euclidean 
space or a tree. We use the term distribution for probability measures on larger spaces, 
such as the space of measures, sequence spaces over measures, etc. 

This section uses some basic notions from ergodic theory. A good introductory 
reference is [35]. 

7.1. Generic sequences. Let M be a compact metric space (which, later on, we 
usually do not specify) and denote by T the shift map on . A sequence Jl = 
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(/^n)^i € is generic if the sequence of distributions 

1 ^ 

An{'P,T) = -^^Sxr^-p 

n=l 

converges in the weak-* topology, as N ^ oo, to some distribution P G V{M^) (note 
that these distributions are the uniform measure on the initial N points of the orbit 
of IJ, under T). Equivalently, there is a distribution P G 'P{M^) such that, for any 

/ e C{M^), 

1 r 

- y: fiT-n) - / fdp. 

n=0 '' 

When it exists, the limit distribution P is T-invariant. Similarly, Ji is totally generic if 
for each q > 1, the corresponding average 

n=0 

converges as A'^ — > oo to a distribution Pg. In general, total genericity is stronger than 
genericity. 

An ergodic T-invariant distribution P on decomposes under T'^ into q' ergodic 
components for some q'\q, and these components average to P. When Jl is generic for 
P and the averages lim.AN(jI,T'^) converge to a distribution Pg, then Pg is invariant 
under so it is a convex combination of these q components. Thus, 




This implies the following lemma which we record for later use: 

Lemma 7.1. Suppose JI G A4 is generic forPe V{M^). Letq>l and suppose that 
A]si{ll,T'i) Q. Let f G C{M^). Then there is an i e {0, . . . ,q - 1} such that 

n=0 

Proof. Immediate from the fact that P = ^ Yli=o T^Q- ^ 

7.2. Boxes and scaled measures. A c/-dimensional box is a product of d intervals 
of positive length, each of which may be open, closed or half-open. The eccentricity of 
a box is the ratio of the lengths of the longest and shortest side. 

A box is normalized if its volume is 1 and its "lower left corner", i.e. the lexico- 
graphically minimal point in its closure, is at the origin. For instance [0, 1]*^ and (0, 1)'' 
are normalized. Every box B can be scaled and translated in a unique way to give a 
normalized box, which we denote B* . We also define the linear operator Tg by 

Tsix) = —YS^^ ~ minB), 
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where min refers to the lexicographical ordering. Thus B* = TbB. 
Recall that for a measure // and box B with lJi{B) > we write 

which is a probability measure supported on B. We also define 

This is a probability measure supported on B* . We call this the re-scaled version of 

7.3. Dynamics along filtrations. Suppose now that we are given a p-regular se- 
quence of refining partitions = {J-'n)'^=i of a box B into sub-boxes. For fj, supported 
on B and x G supp/x we write J-'nix) for the element of J-'n containing x. Define 

l^x,n = I^Tnix) 

and 

In this way, for each /j, and x we obtain sequences of measures {lJ'x,n)^=i and (//^'"')^;^. 
The former sequence does not exhibit interesting dynamics, since the support of the 
measures decreases to a point. However, elements of the latter sequence have been 
re-scaled and the sequence potentially exhibits interesting dynamics. Because the fil- 
tration is /9-regular the eccentricities of J-'nix) are bounded, so there is a bounded 
region of M'' supporting all the measures ^u^'". Thus all the /i^'" belong to some weak-* 
compact set in the space of measures. 

Definition 7.2. Given a measure ^ on and a p-regular sequence = (J^n)^=i of 
partitions, fi generates a distribution P G V{M^) at x if (/x^'")^^ is totally generic 
for P. It generates P along T if it generates P at /x-a.e. point. 

We record for later use the following useful result: 

Lemma 7.3. Suppose /i generates P w.r.t. a partition {J^n)- Let E be a set with 
li{E) > 0. Then v = he generates P. 

Proof. Write v = /ig. By the martingale theorem, for /x-a.e. x E E, 

which implies that, for a.e. x £ E the sequences (/i^'")^]^ and (z^^'"')^^]^ are weak-* 
asymptotic as n oo, so if one is generic for some distribution, both are. □ 



7.4. CP-chains. We next introduce a slightly generalized version of Furstenberg's CP- 
chains, which will supply us with measures and filtrations leading to generic sequences. 
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Definition 7.4. Let f be a family of boxes. A partition operator A on £" assigns to 
each E £ £ a partition A.E = {Ei} C £" of in a translation and scale-invariant 
manner, i.e. if Sx = ax + b and E, SE € £ then S{AE) = A{SE). 

Define the iterates of A by S € f by 

A'{B) = {B}, 
A"+i(5) = J A{E). 

Thus A"(i?) form a sequence of refining partitions of B. 

Definition 7.5. A partition operator A on is p-regular if for each B € £ the sequence 
of partitions (A"i?)5^^ is p-regular. 

For example, the base-6 partition operator is defined on £ = {[a,b]'^ : a < 6} by 
A([0, 1]'^) = Vb (and extend by invariance to all cubes). Then A"([0, 1]'') = D^n. This 
operator is l/&-regular. 

Definition 7.6. A CP-chain for a p-regular partition operator A on is a stationary 
Markov process {nn, Bn)^=i, where: 

(1) The state space is the space of pairs (/U, B) in which B £ £ is a box and /i is a 
probability measure supported on B* . 

(2) The transition is given by the law 

for B € A(^*) , {n,A) ^ {fJ-^ , B) with probability n{B). 

We usually do not specify A (or £), and use this symbol generically for the partition 
operator associated to a CP-chain. 

The stationary process {fin)'^=i is called the measure component of the process. We 
shall not distinguish notationally between the distribution of the CP-chain, its measure 
component and its marginals. Thus if P is the distribution of a CP-chain we may write 
(^, B) ^ P, ^ P etc; the meaning should be clear from the context. 

Furstenberg's CP-chains are recovered using the base-6 partition operator. We use 
this partition operator everywhere except in the proof of Theorem 11.31 where a slightly 
more elaborate partition operator is needed. We remark that one can introduce even 
more general CP-chains by allowing the partition to depend also on the measure, i.e. 
Bn+i = A{Bn, iJ^n), and also allow more general shapes than boxes; but we shall not 
need this. 

The following consequence of the ergodic theorem is immediate: 

Proposition 7.7. Let (/x„,i?„)J^^ be an ergodic CP-chain with partition operator A 
and distribution P. Then for P— a. e. {fi,B), /i generates the measure component of P 
w.r.t. the partitions Tn = A"(i?*), n = 1,2,3 ... . 
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Proof. Given a typical {fii,Bi), consider the conditional distribution on i?„)5^2i 
obtained by running the chain forward from {fj,i,Bi) using the transition law in the 
definition. Note that the random set 

~ • • • ^^2 1 

satisfies An G A"(5j') and 

Since diamj4„ — by regularity of A, the intersection flj^i consists almost surely 
of a single random point X £ Bi. By definition 

and furthermore the transition law has been so chosen that X is distributed according 

to fll. 

Now, by the ergodic theorem almost every realization {fJ-n, Bn)^^i is generic for the 
CP-chain, and in particular almost every {fXn)'^=i is generic for the measure component 
of the process. Hence for almost every and almost every (^i)^^^ conditioned on 
fii this is true. But by the above, given fii the conditional distribution on sequences 
(/"n)J^i of measures is the same as the distribution ((^i)^'")^^^ when x is distributed 
according to /i, as desired. 

Total genericity follows in the same way, because almost every point in an ergodic 
system is totally generic. □ 

Corollary 7.8. Let (fin, Bn)^=i be a CP-chain with partition operator A. Let P(p,B) 
denote the ergodic component of {fj,,B). Then for a.e. pair {fj,,B), generates the 
measure component of P(^^ b), w.r.t. the filtration Tn = A"(i?*). 

Proof. This follows from the previous proposition and the fact that the ergodic com- 
ponents of a Markov chain are Markov chains for the same transition law. □ 

The next lemma is analogous to [131 Theorem 2.1] (and the remark following it). In 
the examples we shall encounter one can either rely on that proposition, or else the 
statement will be clear for other reasons, but we outline a proof for completeness. 

Lemma 7.9. For an ergodic CP-chain {^n, Bn)'^=i, a.e. measure fin is exact dimen- 
sional, and dim is almost surely constant. 

Proof. It is easy to see that dim* /i„ is non-increasing in n, since fin+i is, up to scale 
and translation, the restriction of /x„ to Bn+i- Since dim=K(-) is a Borel function of the 
measure, and the process is ergodic, dim^,(-) must be almost everywhere constant. The 
same argument works for dim*. 
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To see that (and hence is exact-dimensional, condition the process on (/xi, Bi), 
and let An be defined as in the proof of Proposition 17.71 Then 

lim ^^^Sf^MN) ^ 1 log^i(^n) 



N-^oo logdiamAAT Af^oo A^logp log;Ui(j4„_i] 

N-l 



lim -— log/i„(B„+i). 

iV^oo iVlogp ^ 



n=0 

(in the first equality we used /O-regularity of the partition operator). Writing P for the 
distribution of the process, the ergodic theorem implies that the above converges to 



almost surely. Using the fact that X = nAn is distributed according to //, and using 
regularity of A again, it follows from Lemma [321 that 



dim(/ii , x) = dim(^i ,x) = a 
for /ii-a.e. x, which establishes the lemma. □ 

In light of the previous lemma, we refer to the dimension of a typical measure for a 
CP-chain as the dimension of the chain. 

7.5. Micromeasures and existence of CP-chains. The following discussion is adapted 
from |13| . 

Let /X be a measure on M'^. A micromeasure of jj, is any weak limit of measures 
of the form /i*^", where the Qn are cubes of side length tending to 0. The set of 
micromeasures of /x is denoted Micromeasures are closely related to the tangent 
measures of geometric measure theory. 

Starting from a measure /i on [0, l]"^ and the 6-adic partition operator, one can run 
the chain forward. If one averages the distributions at times 1,2, ...,n one gets a 
sequence of distributions which in general will not converge, but one may still take 
weak-* limits of it. These limiting distributions can be easily shown to be CP-chains 
and are supported on the micromeasures of //. In this way we have associated to ^ 
a family of CP-chains supported on (//), and from which one may hope to extract 
information about /x. One result in this direction is the following theorem, which 
appears in another form in Furstenberg's paper [12]. Since it is not stated there in this 
way, we indicate a proof. 

Theorem 7.10. Let ^ he a measure on [0, l]"'. Then there is an ergodic CP-chain of 
dimension at least dime(//) supported on (/x). 



Proof. This is very similar to [131 Proposition 5.2]. Using the notation from that paper, 
we indicate how to modify that proof to get the desired result: to form the measures 
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(which correspond to our distributions on measures), use the given measure // rather 
than the measures On constructed in [13]. Moreover, consider only values of I along a 
sequence such that 

1 



H{fi,V i^) dime(/i), 
Ij logp P 

which we are allowed to do thanks to Lemma 13.31 above. Then proceed as in [131 
Proposition 5.2] to take a weak limit point of the distributions f£, which is clearly 
supported on (n). It follows that / H{u,'Dp)dP{u) > dime(/u). Hence there is an 
ergodic component Q of P for which the same holds. Q is still a CP-chain and is 
supported on (//), and its dimension is equal to the average above (this follows from 
[T3l Theorem 2.1]), which is at least dime(//). □ 

8. Dimension of projections and CP-chains 

In this section we establish some continuity results for linear and smooth projections 
of typical measures for CP-chains. We show that the dimension of these projections is 
controlled, or at least bounded below, by mean projected entropies of the CP-chain. 

8.1. Linear projections. Fix an ergodic CP-chain [fin, Bn)^=i with partition oper- 
ator A and distribution P. Fix k and an orthogonal projection tt G ^d,k- Given a 
measure u and g G N, let 

1 

qlog{l/p) 

and denote the mean value of Cq by 



Eq = j eq{v)dP{u). 



For the rest of the section, the constants implicit in the O(-) notation depend only 
on /9, the constant in the definition of p-regularity, d and k. The following theorem 
contains the proof of Theorem 11.91 

Theorem 8.1. Let P he the distribution of an ergodic CP-chain, vr G 11^^^ a projection, 
and let Cg, Eq he defined as above. Then if p is a measure generating P along a filtration 
Tn = A"5*, then 

(8.1) dim*(7r/i) >^5-0(l/g). 

In particular, this holds for P-a.e. fi. 

Proof. First, suppose that the measure component of the process is totally ergodic. 
Since // generates the measure component of P, by Proposition 17.71 

N-l 
n=0 
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Fix q. Using linearity of vr, the /3-regularity of and Proposition 15.31 we see that for 
every n € N, 

|i7p,(7r/i'='") - Hpn+^{'K^i^^n)\ = 0(1)- 

Therefore for //-a.e. x, 

1 1 " 

Let X be the p'^-tree whose nodes at level n are the atoms of !Fqn with ancestry 
determined by inclusion. Define / : X ^ M*^ by 

oo 

{f{E^,E2,...)]= f|K. 

n=l 

Then / is Lipschitz by p-regularity of !F (the Lipschitz constant depends also on the 
constant C in the definition of /O-regularity) . Let Ji be the lift of ^l to X% 

For the map f = irf : X ^ Ml' apply Theorem 15.41 obtaining a p'^-tree Y and 
maps X R*^ as in the theorem. It follows that for each n-cylinder E C X, 

corresponding to G J^qn, we have 

Hp,(n + 1) (7p^) - + (gH^) = 0(1). 

Thus, for gjl-a.e. y €Y, 

1 1 ^"^ 

^1^^ l^j^f N ^P'<"-) (my...y.]) >Eq- 0{l/q). 



By Theorem 14.51 this implies that 

dim* gjl> Eg- 0{l/q), 

and since h is faithful, 

dim* hgjl > Eg — 0{l/q). 

As hgJl = vr/x, we are done. 

Suppose now that the measure component of the process is not totally ergodic. For 
//-almost every x, there is, by Lemma [7?Tl an i = i{x) € {0, . . . ,q — 1} (which may be 
chosen measurably in x) such that 

N-l 

liminf- j;en(/i"'^"+')>i?,. 

n=0 

Let Ai C be the partition according to i{x). We may apply the argument above to 
TsfJ-AiCiB for each i and each B ^ J^i separately, using the induced filtrations TbJ^ (see 



^If n gives non-zero mass to the boundaries of partition elements the lift may not be unique. Fix for 
example the lift defined by the condition that the cylinder set corresponding to -E G J-„ has mass /i(-B), 
and we choose to be this measure. Alternatively, we may assume that the boundary of partition 
elements is null by reducing, if necessary, to a lower-dimensional case as in |13| . 
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Lemma [73]) . Since /i is a weighted average of the measures fJ-AiCiF, this completes the 
proof. □ 

We now let vr vary. Thus Cq : A4 x Ild,k [0, k] and Eg : ILd,k — ^ [0, k], and we write 
65(1/, tt) and Eq{Tr) to make the dependence on vr explicit. Note that the next theorem 
implies Theorem 11.101 

Theorem 8.2. Fix an ergodic CP-chain of dimension a (recall Lemma \7.9\} with dis- 
tribution P. Define Cq and Eq as above. The limit 

E{7r) := lim Eq{Tr) 

q—too 

exists and E : Ild,k [0, k] is lower semi- continuous. Moreover, 

(1) E{it) = min(A;, a) for almost every vr. 

(2) For a fixed vr € Ild,k, 

dim ^iiTu) = dim* vr^ = £'(vr) for P — a.e. /x. 

(3) For any measure fx that generates (the measure component of)P along a filtra- 
tion {A"5*}, 

dimg(7r^) > dim* vr^ > £'(vr) for all n G 11^,^. 

In particular, the above holds on a set M with P{M) = 1. 

Proof. We first establish convergence of Eg and claim ([2]). It follows from Theorem 18.11 
that if fi generates P, then we obtain 

(8.2) dim*(7r;u) > limsup ii^„(7r). 

On the other hand, by definition of entropy dimension, we have 

dim (ttm) = liminf en(/i, tt), 

n— >oo 

Integrating, we have by Fatou that 

(8.3) j dimj TTu)dP(u) < lim^inf £;„(7r). 

Since dim* nfj, < dim^ -zr^ holds for any measure by equation (13.11) in Proposition 13.41 
combining (|8.2I) and (18. 3|) we see that En{7r) converges and the limit is the common 
value of dim* vr/i = dim ^TTU for almost every /j, (possibly depending on vr). 

To prove ([l|), write f3 = mm{k, a} for the expected dimension of the image measure. 
For almost every //, by Theorem 13.71 we have dimvr/i = /?. By Fubini, for a.e. vr this 
holds for a.e. /i, hence E{tt) = (3 for a.e. vr (we remark that entropy dimension of a 
measure is a Borel function of the measure). 

We next establish semicontinuity of E. Given tt S Ild,k and e > there is a g so 
that Eg{TT) + 0{l/q) > E{-ir)—£, where 0{l/q) is the error term in Theorem 18. 11 Since 
Eg is continuous, this continues to hold in a neighborhood U of vr. By Theorem 18.11 
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for almost every fi, if n' ^ U then by letting q — > oo we get dim* tt'/j, > E{-ir) — e. This 
implies that E^n') > E{tt) — e for vr' S U, and since e was arbitrary, semicontinuity 
follows. 

The last statement follows from (18. 2|) . □ 

We shall also encounter non-ergodic CP-chains. In this case much of the above fails; 
for example, the dimension of the projection through vr of need not be a.s. constant, as 
they may differ by ergodic component (nor do measures for the process have to have 
the same dimension a.s.). However, we have the following substitute: 

Theorem 8.3. Let he a CP-chain whose measures almost surely have exact 

dimension a, and write (3 = min{A;, a]. Then for any e > there is an open dense set 
U C Hfi^k and a set of measures with P{Mf;) > 1 — e and such that 

dim^, vr/i > /5 — e 

for every tt (zlA and € M^. 

Proof. Applying Theorem [321 and Fubini, we can find a dense set of projections {T^i}fZi — 
Ild,k such that for a.e. ^, 

dim* TTifi = P for i e N. 
This together with equation (|3.ip imply that for almost every /x, 

liminf e(,(^, TTj) > (3 

g— >oo 

for every i, and we can therefore find a set of measures with P^M^) > 1 — e such 
that for each i, the convergence above holds uniformly in /i € M^. Prom this and the 
proof of Theorem 18.21 for each i there is an open neighborhood Ui C ^ of vTj such 
that 

dim* ir'fi > [3 — e for vr' G Z/Zj , /x G ■ 
Now U = Ui^i is the open dense set we are looking for. □ 

8.2. Non-linear images of measures. Fix an ergodic CP-chain, let /i be a typical 
measure for it, and define Cn, En etc. as in the previous section. In the proof of Theorem 
18.11 linearity of vr was used only for the bound 

|i/p,(vr^^'") - //pn+,(vr/i^„(^.))| < 0(1). 

Now replace vr with a differentiable but non- linear map /. For n large enough, 
approaches the linear map D^f, and hence, after a little work, the bound above can 
be replaced by 

\H,,i{D,f)^l^'-) - < 0(1). 

We have omitted a few details here but this is essentially how the following proposition 
is proved. 
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Proposition 8.4. Fix an ergodic CP-chain with distribution P, a projection vr € Tid,k, 
and define Cg and Eg as before. Then for all maps f : [0, 1]*^ M.^ such that 
suPajGsupp/. \\Dxf - 7r|| < p'^ , wc havc 

dim,(//x) >Eg- 0{l/q) 

for all /i that generate the measure component of P along a sequence of partitions 
Tn = A"'{B*) (and in particular for P-a.e. fj,). 

Proof. The proof is completely analogous to that of Proposition 18.11 and we only indi- 
cate the differences. 

We construct X from and tt : X ^ precisely as before, lift // to pt and let 
7 = /vr : X ^ R^. Construct X y R'' as before. We wish to estimate the 
dimension of gjl and for this we must estimate 

1 

liminf — ^ i?pg{n+i)(5M[^i...^„]) 

for /I-typical x £ X. Briefly, the point is that as n ^ oo the map g at [xi . . . x„] 
looks more and more like / on J^gnix), which looks more and more like D^f, which is 
not far from vr, so we are essentially averaging Hpq(„+i){Trfi-f^^(^^'^). By p-regularity of 
the filtration, this is almost the same as the average of H pq{'nT-p^^(^^-^pjr^^(^^-^) , which is 
Hpq{'iTfi^''''^) and we get our bound by total genericity. 

Here are the details. Losing an 0(1) term, by Theorem 14.31 it suffices to estimate 

1 

N^oo I\ ^ — ' ^ 

n=0 

for /x-typical x. Now, we change scale: letting An denote scaling by p~^'^ on R'^', 

Hpq{n+i){fflx,qn) = Hpq{An{f IMx^qn)) 

Since / is differentiable at x, we see that 

lim {An{ffJ,x,qn) - An o {Dxf){px,qn)) = 0, 
n— >oo 

and since An o Dxf = Dxf o An, where An is scaling by on R'^, we have 

lim (An{fHx,qn) " (Dxf) O An{fix,qn)) = 0. 

The same is true after we apply Hpg to these measures, so it suffices to estimate 

1 

^W^^f 1^ Hpq{{Dxf) o An{p.x,qn))- 

n=0 

Since \\Dxf — vr|| < p'^ we have by Lemma [3?6] that 

\Hpq{T,v)-Hpq{{Dxf)v)\=0{l) 
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for any u, in particular for v = An{fJ.x,qn)', thus we only need to estimate 

1 

liminf — iJp9(7r o 
N^oo iV ^ — ' 
n=0 

Finally, since Hpq{-) is invariant under translations and ^is p-regular, by Lemma[33]we 
may replace An by Tjr^^(^^-^ at the cost of losing another 0(1). Since Tjr^^(^^^^x,qn = 
we have reduced the problem to estimating the ergodic averages 

N—i-oo iV ^ — ' 
n=0 

which, by total ergodicity of P, equal 

I Hp,{TTv)dP{v) = glog(l/p) • i^,(7r), 

as desired. □ 

Proof of Theorem \1.1^ The theorem is an immediate consequence of Theorem I8.2l ([2l) 
and Proposition I8.4[ □ 

We also have the following strengthening of Corollary 11.111 

Corollary 8.5. For every e > there is an open set Ue C C^([0, 1]*^, M^) with the 
following properties: 

(1) Ue n Tid,k is open, dense and has full measure in Ild,k- 

(2) If ^ generates the measure component of an ergodic CP-chain along a filtration 
{A"(5*)}, then 

dim^ 7r/_i > min(A:, a) — e for all vr G Ug, 

where a is the dimension of the CP-chain. 

Proof. This is an immediate consequence of Theorem I8.2IIT] ) and Proposition 18.41 □ 



Finally, we obtain Theorem 11.131 as another consequence of Proposition [8 

Proof of Theorem \1.13[ Fix a measure /x that generates the measure component of the 
CP-chain along a filtration {A"'{B*)} and a C^ map g : supp// — > M'^ without singular 
points. It follows from Proposition 18.41 that, given a g G N, for every x G supp(/u) there 
is r = r{x) such that 

dim^gfiBrix)) > EgiD^g) - 0{l/q). 

It easily follows that 

dim* gij. > essinir^^f. Eg (D^g) - 0(l/g), 
and we obtain the theorem by letting g — > oo. □ 
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9. Self-similar measures 

9.1. Self-similar sets and measures. In this section we prove Theorem 11.61 We 
begin by briefly recalling the main definitions involved. 

A map / on R'^ is called a contraction if it is C-Lipschitz for some C < 1. Let A be 
a finite index set; a collection {fi : i € A} of contractions on is called an iterated 
function system or IFS for short. As is well-known, there is a unique nonempty compact 
set X, called the attractor of the IFS, such that X = IJigA fii-^)- a = ai . . .an € 
A", write 

fa = fai o fa2 ° ■ ■ ■ ° fa„ ■ 

Given a G A^ and x € M*^, the sequence fai...an{^) has a limit which lies in X and is 
independent of x. This defines a continuous and surjective map $ : A^ —>■ X, called 
the coding map. 

We say that the strong separation condition holds for {fi : i € A} if the sets fi{X) 
are pairwise disjoint. This implies that the coding map is injective on the attractor. 

Given an iterated function system {fi'.iG A} and a probability vector {pi)i^\, one 
can form the product measure on A^ with marginal {pj : i G A}. The push-forward of 
this measure is the unique probability measure on M'^ satisfying 

= ^PififJ'- 

iGA 

The collection of pairs {{fi,Pi)}ieA is called a weighted iterated function system. 

When all the maps fi of an IFS are contracting similarities, one says that X is a 
self-similar set, and a measure as above is a self-similar measure. Under the strong 
separation condition self-similar measures are quite well understood; in particular, they 
are exact dimensional. In general, however, projections of self-similar sets or measures 
are not self-similar, and may have complicated overlaps even if the original set does 
not. 

9.2. Proof of Theorem 11.61 Let {fi : i € A} be an iterated function system satis- 
fying the hypotheses of Theorem 11.61 As with Theorem 11.31 the proof of Theorem 11.61 
has two parts: first we establish a continuity result (or, rather, a topological statement 
about the set of nearly-good projections), and then use invariance of (pieces of) the 
self-similar measure under a sufficiently rich set of orthogonal maps to conclude that 
all linear projections are good. 

We shall rely on the existence of a CP-chain supported on measures closely related 
to the self-similar measure from which we begin. 

Proposition 9.1. Let fx be a self-similar measure for an IFS satisfying the strong 
separation property. Then there is a CP-chain supported on measures v such that, for 
some similarities S,S' and Borel sets B,B' (depending on u), we have /i = Svb and 
V = S'hb'. 
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There are a number of ways to construct such a CP-chain. One is to rely on Theorem 
17.101 but then one must work a bit to show that micromeasures on which the chain 
is supported have the desired property; not all micromeasures do, since, for example, 
one can always obtain micromeasures which give positive mass to some affine subspace, 
even when the original measure gives zero mass to all subspaces. This approach for the 
homothetic case is discussed in [H]. The general case is proved in [16j . 

Given a measure /i on M'^ and a > 0, let 

UaifJ-) = {vr G Ud^k ■ dim* vr^ > a}. 

The following observation is immediate from the definition of lower dimension: 

Lemma 9.2. Let /x, u be two measures on M*^, and suppose that Sfj, = vq for some 
similarity S and some set Q with v{Q) > 0. Let O be the orthogonal part of S. Then 
for any a, 

Ua{fi)^{gO:geUa{i^)} 

With this machinery in place, we have: 

Proposition 9.3. Let ^ be a self-similar measure for an IFS satisfying the strong 
separation condition. Then for every e > there exists an open and dense set Li C 11^ ^ 
such that 

dim*(7r/i) > min(A;, dim//) — e for all ir gU. 

Proof. Choose an ergodic CP-chain supported on measures which, up to similarity, 
contain a copy of // as a restriction. Choose e > 0. By Corollary 11.111 there is an open 
dense set of projections W C 11^ ^ so that for a.e. measure for the chain, the image 
under any n has dimension at least min(A;, dim //) — e. Choosing a typical measure 
and applying the previous lemma, we see that 

Ua^e 5 {vrO : TT G U'}, 

for some orthogonal map O. This completes the proof. □ 

We first establish Theorem 11.61 in the case of linear maps: 

Proposition 9.4. For every vr G Ild,k, 

dim(7r/x) = min(/c, dim/x). 

Proof. Fix e > 0. By Corollary 13.91 it suffices to show that for every vr G Ild,k, 

dim*(7r/i) > min(A;, dim /i) — e. 

Let Ue C Ilrf fc denote the set of projections with this property and let tt G Ug. For 
a G A and B = faiX), where X is the attractor, the same inequality holds with vr/x^ 
in place of irfi. Since = /a(/u) by strong separation, we conclude that 

dim^, TT o faip) > min(A;, dim fi) — e. 
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But clearly tt o has the same dimension as tt o OaH, where Oa is the orthogonal 
part of fa, so IT oOa (zUe. Hence Us is invariant under the (semi) group action on 11^^^ 
generated by pre-composition withjOa : a G A}. By Proposition 19.31 the set C n^^k 
has non-empty interior, and by assumption the action in question is minimal; therefore 

Ue = nd,k. □ 

Finally, to prove the assertion of Theorem II .61 about non-linear images of /i, we rely 
on Theorem 11.131 Let P be the distribution of the CP-chain found in Proposition I9.1[ 
It suffices to show that the function -E'(vr) associated to P in Theorem II. 101 is equal, for 
every tt, to the expected dimension, i.e. a = mm{k, dim /j,}. For this we need only note 
that, for a P- typical measure we have u = SfiB for some Borel set B and similarity 
S. Since dim* nfj, = a for every vr € Ild,k, the same holds for i^. Thus the same also 
holds for entropy dimension dimevrz/, vr G Ild,k- But since u was an arbitrary P-typical 
measure, we see from Theorem 18.21 that E{tt) = a for every vr G Iid,k, as desired. 

10. Furstenberg's conjecture and measures invariant under xm 

In this section we prove Theorem 11.31 if m, n are not powers of a common integer 
and /i, u are respectively Tm and T„-invariant measures on [0, 1], with T^x = kx mod 1, 
then ^ X V projects to a measure of the largest possible dimension for any projection 
other than the coordinate projections. 

The proof has two parts. The first is to associate to x a CP-chain and derive 
topological information about the set of projections which have the desired property. 
The second part of the proof uses irrationality of log m/ log n to boost this information 
to the desired result using a certain invariance of the set of "approximately good" 
projections. 

10.1. Invariant measures and CP-chains. We first demonstrate how a Tm-invariant 
measure gives rise to a CP-chain. We do not use this directly here, but it serves to 
explain the construction in the next section. 

Lift [0, 1] to = {0, . . . , m — 1}^ via base-m coding and denote by T the shift 
map, which is conjugated to Tm- Let /i be a non-atomic Tm invariant measure on [0, 1], 
which we identify with its lift to . Let // also denote the shift-invariant measure on 

X = {0,l,...,m- 1}^, 

obtained as the natural extension of {X^ , ^,T); the shift on X is also denoted T. Let 
(X„)^_j^be the coordinate functions 

Xn{x) =Xn X e X 

Disintegrate fj, with respect to the cr-algebra T~ = a{Xi : i < 0). For /u-a.e. x G X 
we obtain the measure on X~^ , depending only on x~ = (. . . a;_2, xq), such that 
for any A C X+ we have /i(j4) = / ^^{A)d^{x). For ;U-typical x G X construct the 



Local entropy averages and projections 



39 



sequence 



,T"x 



eP([o,i]) xv„ 



m m 

Pushing the measure /i forward via x i— > ((/in, we obtain a stationary ■P([0, 1]) x 

Pm-valued process which is seen to be a CP-chain for the base-m partition operator. 
This is analogous to the second example in [131 page 409]. 

We record the following well-known fact: If n is an ergodic Tm-invariant measure, 
then both // and //-a.e. /i^ are exact dimensional, with 

(10.1) dim/i = dim/i^ = /i(/u,Tm)/log?Ti, 

Here h is the Kolmogorov- Sinai entropy (for fi this follows from Shannon-McMillan- 
Breiman, for fi^ one uses e.g. Lemma l42]l . When ^ is not ergodic and fJ- = J ^ujda{uj) is 
its ergodic decomposition, then ^ is exact dimensional if and only if almost all ergodic 
components have the same dimension (i.e. the same entropy). More generally, one can 
show that [m Theorem 9.1] 



h{n,Tm) = J h{fi^,Tm)da{uj) 



and 

(10.2) dim^, /u = essinfa;^^- dim^, jj^^. 

10.2. Products of xm,xn invariant measures. We now deal with the more delicate 
case of a measure 

9 = fi X u 

where /x, u are measures on [0, 1) invariant, respectively, under Tm,Tn. When € Q 
the product measure is invariant under the action Tk x Tk for some /c € N which is a 
common power of m, n, and we get a CP-chain in much the same manner as before. In 
the case that ^ Q the product action, and also all actions of products of powers 
of Tm,Tn, are not local homotheties, and the iterates of the product partitions do not 
have bounded eccentricity. Instead, we shall show that this measure is associated to a 
CP-chain using a more involved partition operator and some additional randomization. 

For concreteness we fix m = 2 and n = 3, and assume as we may that fi, v are non- 
atomic. As before, identify /i,z^ with shift-invariant measures on X+ = {0,1}^ and 
= {0,1,2}^ and extend to the two sided versions on the corresponding two-sided 
subshifts X, y. Let (X„)J^_j^, {^r\)^=-oa denote the coordinate functions, and let 
G X and v"^ € Y denote the disintegrations with respect to J^^ = (j{Xn ■ n < 
0) and G~ = a(Yn : n < 0}, respectively. 

To construct our CP-chain we first describe our partition operator. For w G [0, log 3) 
define the rectangle 

i?^ = [0,1] X [0,e"'] 
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0<e"'<3/2 



3/2<e"'<3 



Figure 10.1. The partition operator A. If the eccentricity of R is 
e'", then for the rectangles in A(i2) the eccentricity is e"^ where w' = 
w + log 2 mod log 3. 



and let £ be all rectangles similar to some Rw,w G [0,log3). We shall define the 
partition operator A on the sets and extend by similarity to the rest of £. 

To apply A to R^, first split Ry, into R' = [0, i] x [0,e'"] and R" = (^,1] x [0,e"']. 
Then, i{w> log(3/2), split R' into three sub-rectangles with the same base and heights 
i^w ^ g«;-log3^ similarly R" . The partition obtained is A{R^); see FigureHEIl It 
is a partition of R^, into either two or six copies of Ryiji , where 

(10.3) w' = w + log2- l{^>iog3-iog2} • log3 G [0,log3) 

Thus, applying A repeatedly to a rectangle R^i results in partitions into rectan- 
gles similar to Rw„, n = 1,2,3... and the sequence {wn)^=i evolves according to an 
irrational rotation by log 2/ log 3. 

Lemma 10.1. For any w G [0, log3), the filtration Tn = A^Rw is ^-regular. 

Proof. Immediate, since the base of rectangles in /V^Rw are of length (|)" and by 
definition their eccentricity is uniformly bounded. □ 



Let 






so Sw{[0,lY) = RIj and the axes are eigen-directions. Choose {x,y,w) £ X x Y x 
[0,log3) according to ^ x x A, where A is Lebesgue measure, and associate to it the 
measure 

T — Tx,y,w — SwifJ- X U^). 

The distribution of r is the marginal of the measure component of a CP-chain. To get 
the distribution on pairs which is the marginal of the full CP-chain, choose B' G A{Rw) 
with weights t{B') and setting r' = r^'; then (r', B') is the desired marginal (or course. 
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r' has the same distribution as r). One verifies this by checking that the distribution 
is fixed by the transition law with respect to A@ 

Note that, if /i and v are ergodic, then S^i^if" x v^) almost surely has the same 
dimension as 6 = fi x i^. 

Proposition 10.2. Suppose /U and v are ergodic. For every e > there is a subset 
C [0, 1]^ with 0{Ai;) > 1 — e and a dense, open set lA^ C 112,1 such that for vr € Us, 

dim^, vr (6*1^5) > min{l, dim 6*} — e. 

Proof. Let P denote the distribution of the CP-chain, and write 

a = min{l,dim^}. 

This is the expected dimension of the projection of a typical measures for a typical 
ergodic component of the process. By Theorem 18.31 we can find a dense open set 
U'e ^ n2,i and A'^ C V{W^) such that P{A'^) > 1 - e, and dim* vrr > a - e for all vr G 
and T & A'^. 

Using the fact that, conditioned on U), the distribution of r = Tx,y,w is Sw{fj,^ x i/?'), 
with (x, y) ~ 9, we can apply Fubini to find a G [0, log 3) and a Borel set E Q X xY 
such that 9{E) > 1 — e and for every {x, y) G E, 

dim* 7r{SwofJ'^ ^ '^^) > ct - e for all vr G U'^. 

For an affine map S we identify tt o S with the projection tt' whose pre-image sets 
partition into the same lines as vr. Let 

Us = {tt o : TT G U's}. 

Then U^ is open and dense in 112,1 and for (x, y) G E, 

dim* 7r(/i^' x z^^) > a — e for all vr G Us 

Define rj G Vi[0,l]^) by 

77 = / fi^ X d9{x, y) 

JE 

Then by Lemma ISTTl 

dim* -KT] > a — e for all it gUs- 

Since 

= jj X u = I X d9{x, y), 



^The dynamics of this process may also be described as follows. Let T, = X x Y x [0, log 3) with the 
measure x u x \. Let T denote the shift on X, Y. Define a transformation S : E — > E by 

S{uj,Lj',w) = (raj,r^™^'°'5^-'°'5'')tj',w + log2modlog3) 

where T"^ = T if the event a occurs and is the identity otherwise. This represents the dynamics as 
a skew product X x [0,log3) with fiber Y, in which the second coordinate w is used to control the 
average speed with which the third coordinate (i.e. Y) is advanced. 
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we have rj <^ fj, x u, and since 9{E) > 1 — e there is a subset C [0, 1]^ such that 
^Ue ~ V and 6{A^) > 1 — e. This is the desired set. □ 

10.3. Proof of Theorem 11.31 Let be ergodic Ta-invariant measures on [0, 1], 
respectively, such that dim* fi, dim* u > 0. Let 9 = fj, x u. Write 

a = min{l,dim^}, 

and for e > let C 112,1 denote the set 

There exists AC [0,1]^ with 1 
9{A) > 1- e and dim* tt{9\a) > a- e j 

In the previous section we saw that this interior of Vg is open and dense in 112,1. 
We now establish some invariance. For this it is convenient to represent projections 

£ n2,i by the slope of their image, i.e. tTq projects onto the line y = ax + h (we 
cannot represent projection to the y-axis, but also do not want to). We note that this 
identification of Ve as a subset of M is consistent with the topology on 112,1. 

Proposition 10.3. Ve is invariant under the action by multiplication of the semigroup 
S = {^ : ij G N} C (N, x), i.e. if i^a G Ve then 7r3a,7r„/2 G Ve- 

Proof. Suppose it a G Ve and let A C [0, 1]^ be a set of measure > 1 — e as in the 
definition of Ve ■ 

Write T2 X id for the map [0, 1]^ [0, 1]^ given by 

{T2 X id)(x,y) = {T2X,y), 

and let 

^0 = (Ts xid)-i^n([o,i) X [0,1]), 
A = (r2xid)-i^n([i,i)x[o,i]), 

so {T2 X id)^^A = Aq U Ai. By the invariance properties of /i, v we see that ^ x is 
invariant under T2 x id, so 

fx X u{Ao UAi)=fiX v{{T2 X id)"^^) > I - e. 

Also, write 

/^O = ^|[0,l/2)x[0,l] 
/^l = /^l[l/2,l]x[0,l] 

Now, by invariance we have 

(10.4) fJLXV= (T2/io XV) + {T2fil X v) 
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It follows that there are afRne maps 5*0 ,5*1 of M with 

vra(/i X u\a) = SoTT^/2ifJ.O X u\ao) + 5'i7r„/2(m X u\a^) 

One way to see this is to note that the fibers of 71^/2 are lines of slope —2/ a and are 
mapped under 1— > {2x,y) to the fibers of tTq, which have slope —1/a. 

By assumption dim^, 7ra(/i x vIa) > a — e. Affine maps preserve dimension, so 

a — e < dim,, 7ra(^ X 

= minjdim* 5ovra/2(/io x z^U,,), dim* 5i7ra/2(/Ui x z^Ui)} 
= min{dim*7ra/2(^o x Uo ) , dim* 7r„/2 ( W x z^UJ} 
= dim* 7ra/2(^ x z^|(T2xid)-iA), 

whence 7ra/2 G Ve- 

A similar analysis, the measures Ui = z^|[o,i]x[j/3,(i+i)/3) and the identity 
/X X = Tsifj. X i/q) + T3{fix I'l) + Tsifi X 1^2) 
show that if VTa G V^, then vrsa € V^. □ 
We can now conclude the proof of Theorem 11.31 

Proof of Theorem \1.3[ Using (|10.2I) for ^u, u and Lemma [3m ([2l) applied to 7r(// x z^), the 
general case is reduced to the case in which fi and u are ergodic. Furthermore, by 
Corollary 13.91 it is enough to show that dim* 7r(/x x i^) > a. 
Since log 3/ log 2 ^ Q, the semigroup 

is dense in [ITj; this is the only place in the proof where we use the arithmetic 
properties of m, n. For each e > we have seen that the interior of Ve is open and 
dense, in particular contains an open set in each of the rays M+,M^; and since it is 
invariant under S we see that 

= {vr^ : a / 0}. 

For any tTq, a 7^ we have thus shown that there is a set with /i x i^(^e) > 1 — e 
and 

dim* TTailjL X z^UJ > a — £, 

which implies 

dim* 7r(// x z^) > q, 

as desired. □ 

10.4. Proof of the topological conjecture. We briefly show how to derive the topo- 
logical version, conjecture ll.il from the measure one above. Suppose that X,Y C [0, 1] 
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are closed and invariant under T2,T3, respectively. Using the variational principle (see 
e.g. [32l Theorem 14.1]) we can find probability measures on [0,1), invariant and 
ergodic under T2,r3, respectively, such that dim// = dim A and dimu = dimB. 

Let 9 = fj. X so that dim^ = dim A x B. For any projection tt € 112,1 \ {7ra;,%}, 
we have by the theorem above that 

dimvr^ = min{l,dim0} 

Since irO is supported on 7r{A x B), we have 

dim7r(j4 x B) > min{l,dim6'} = min{l,dim(A x B)}, 

The right hand side is also an upper bound, so we are done. 

10.5. The Rudolph-Johnson theorem. In this section we show how Theorem 11.31 
implies the Rudolph- Johnson theorem. We first prove the theorem under the hypothesis 
that the measure has positive dimension rather than positive entropy. 

Theorem 10.4. Let ^ be a probability measure on [0, 1] that is invariant under 
and Tn, with m,n not powers of the same integer. Suppose that dim* /x > 0. Then 
fi=Lebesgue measure. 

Proof. Write a = dim^ fj, > 0. Denote by * convolution in M and by o convolution in 
M/Z. Notice that, up to an affine map, /i * jU and vri(/i x fi) are the same (recall that 
TTi is projection to the line y = x). Thus, by Theorem [L3] with = v, 

dim,,(/i * v) = dim* 7ri(/i x v) = min{l, 2a} 

Also, since 

fj, o fj, = fj, * fj, mod 1 

and reduction modulo 1 is a countable-to-one local isometry, 

dim*(/x o n) = min{l, 2a}. 

Finally, it is easy to check that /xo/i is again invariant under and r„, because T^, Tn 
are endomorphisms of M/Z. 

Iterating this argument, we see that for each k, 

dim* fj, o o . . . o fj, = min{l, 2'^a}, 
2*= times 

and since a > there is a A; so that, for u = o^''/x, 

dim* u = 1 

But 1/ is a T2-invariant measure, and by (110. 2p Lebesgue measure is the only Tm in- 
variant measure of lower dimension 1 . Thus 



u = X. 
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To establish /i = A, we look at the Fourier coefficients, where convolution translates to 
multiplication. For i ^ we have 

Jl{if = = A(i) = 
Thus = for i ^ 0, so /u = A. □ 

For ergodic measures positive dimension and positive entropy are equivalent con- 
ditions. However, the condition that all ergodic components have positive entropy is 
weaker; entropy behaves like the mean of the entropies of ergodic components, while 
dimension behaves like the essential infimum. Nevertheless, there is a simple reduction 
which recovers the non-ergodic case. 

Theorem 10.5 (Rudolph-Johnson [33l[l8j). Let ^jl he a probability measure on [0,1] 
that is invariant under and Tn, with m,n not powers of the same integer. Suppose 
that all ergodic components of fi have positive entropy. Then fi=Lebesgue measure. 

Proof. Fix t > and let n>t denote the restriction of to the T^-ergodic components 
which have entropy at least t (and hence by (jlO.lj) dimension at least t/logm). Clearly 
preserves fi>t. Also, since r„ is an automorphism of it acts on the ergodic 
components of {fj,, T^), and if a T^-ergodic component v is mapped to , then u' is an 
extension of z^, so h{u',Tn) > h{v^Tn). Hence Tn preserves the ergodic components of 
/x>(, and thus it preserves /x>j. It follows that ii->t is Tm, r„-invariant, and its dimension 
is 

dim* //>t = essmt — > > U. 

~ log m log m 

(the infimum is over T^-ergodic components of /U>t). Hence by the previous theorem 
/x>( is Lebesgue measure. Since fi = lim^>(, we are done. □ 

11. Convolutions of Gibbs measures 

11.1. Preliminaries. Let X = {fi : i € A} be an iterated function system on the 
interval [0, 1]. The IFS T is called a regular IPS if the following conditions hold: 

(1) Regularity: There is e > such that each fi is a map on a neighborhood 
of [0,1]. 

(2) Contraction and orientation: < Dfi{x) < 1 for all i and all x G [0, 1]. 

(3) Separation: The sets /i((0, 1)) are pairwise disjoint subsets of (0, 1). 

We say that a closed set X C [0, 1] is a regular Cantor set if it is the attractor of a 
regular IFS. This definition is more restrictive than the one in e.g. |24j. Our methods 
can handle the more general setting with slight modifications, but for simplicity we 
concentrate on the case above. 

We recall the definition and some basic facts about Gibbs measures; a clearly written 
introduction to this topic can be found in |6l Chapter 5]. Let A be a finite set. If 
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is a Holder-continuous function, then there exist a unique real number 
P{(p) and a unique ergodic measure /j^p on A^, such that 

(11.1) . . . Xn]) = l^exp (^-nP{ip) + ^ ip{T^ 

for all X E A^, where T is the shift on A^. The number P{ip) is known as the topological 
pressure of y?, and the measure fii^ as the Gibbs measure for the potential ip. 

We say that two measures defined on the same measure space are C- equivalent if they 
are mutually absolutely continuous and both Radon-Nikodym derivatives are bounded 
by C; this is denoted by ~c'. If x,y are numbers, we also write x V to denote that 
X < Cy and y < Cx. 

Given a cylinder set [a] and a measure fx on A^, let /il^l denote the probability 
measure on A^ given by 

This is the symbolic analogue of rescaling a measure on M'^. 

While Gibbs measures are not generally product measures, they do satisfy a slightly 
weaker property, which is the only one of their properties that we use: 

Lemma 11.1. Let ^ be a Gibbs measure on A^ for some Holder potential ip. Then 
there is C = C{(p) > such that for any word a € A*, fj, /^^"^ • 

Proof. It follows from pi.ip that 

This shows that and // are C-equivalent for some C that depends on (/? only. □ 

Motivated by the previous lemma, we say that a measure fi on A^ is a C-quasi- 
product measure if C > and ~(7 A* for a € A* or, equivalently, 

c-V([a&])<MNH[fc])<cM[«fe])- 

A few comments about this notion are in order. It is easy to see that the sup- 
port of a quasi-product measure on A^ is for some subset H C A. Thus by 
replacing A by H we can always assume that quasi-product measures are globally 
supported. Each T"^^ is C-equivalent to and taking any weak-* limit point u of 
^ (// -|- T/i -|- T^/i -|- . . . -|- T^^^fi) , as n ^ CX), we obtain a T-invariant measure equiva- 
lent to /i and with Radon-Nikodym derivative bounded between and C . Also, it 
is easy to show that n has a rudimentary mixing property: there is a constant K such 
that if A^B are sets then ^{T^A n B) > Kii{A)fi{B) for large enough n. It follows 
that the same is true of u, perhaps with a different constant, and so u is ergodic. It 
follows now that z^, and hence /i, are exact dimensional. 
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Finally, we transfer these notions to the geometric attractor: A Gibbs measure on X 
is the projection of a Gibbs measure on under the coding map. In the same way we 
define quasi-product measures on X. Thus, Gibbs measures on X are quasi-product 
measures. 

We have the following slight strengthening of Theorem II. 4t 



Theorem 11.2. Theorem \ l-4\ holds for quasi-product measures on the attractors 
Xi. 



For notational simplicity we prove Theorem 111.21 for d = 2; the proof is the same in 
higher dimensions. The structure of the proof resembles that of Theorem 11.61 but is 
technically more difficult. The main differences are, first, that micromeasures of Gibbs 
measures on regular IFSs are harder to relate to the original measure; and, second, 
that moving the open set around 112,1 is more involved due to nonlinearity of the IFSs. 
In the next section we present some standard tools for dealing with these problems. 

Before embarking on the proof we explain the its relation to Moreira's proof of (|1.3I) 
in the strictly non-linear case. The key device there was the so-called Scale Recurrence 
Lemma (SRL) of [24]. This technical result, which relies on the non-linearity of the 
system, gives information about the quotients 

diam(/P([0,l])) 
diam(/f ([0,1])) 

for many (but not all) pairs a G A^,6 G Aj. In the proof one uses only the pairs of 
words that are "good" in the sense of the SRL in order to construct subsets X'^ C Xi 
which approximate Xi well in dimension, but have zero measure with respect to any 
Gibbs measure. Hence it seems unlikely that Moreira's proof can be adapted to give 
information about convolutions of Gibbs measures. Our proof uses the machinery of 
CP processes in place of the SRL, specifically in the proof of Proposition 111.81 After 
that our proof follows Moreira's original argument. 



11.2. Limit geometries and micromeasures. Throughout this sub-section, T = 
{fi : i £ A} is a regular IFS with attractor X, and // is a C-quasi-product measure 
on X. By definition, ^ = where $ is the coding map and r] is a C-quasi-product 
measure on AN. 

Given a closed interval [a, b], denote by T[a,b] the unique orientation-preserving affine 
map sending [a, b] to [0, 1]. If / : [0, 1] — > M is a continuous injection, write 

^/ = ^/{[o,i]) 

and 

/* = Tff. 
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Hence /* is a bijection of [0, 1]. Also, for a € A* write 

l{a)=fa{[0,l]). 

The following simple lemma is a consequence of the separation assumption on the IFS 
and C- equivalence. 

Lemma 11.3. For every a G A*, 
Proof. For any 6 G A*, 

Since {I{ab)} is a basis of closed sets of X n /(a), this yields the lemma. □ 



The following result goes back to Sullivan [34], with an explicit proof given by Bed- 
ford and Fisher [2j. It will be a key tool in the proof, in particular allowing us to 
describe the micromeasures of a Gibbs measure. 

Theorem 11.4. For every left-infinite sequence x = {...x^2X-i), the sequence of 
resettled diffeomorphisms (/a;_„ • • • converges in the topology to a diffeomor- 
phism Fx : [0, 1] — > [0, 1], uniformly in x. In particular, the map x i— > Fx is continuous. 

In this case, the sets Fx{X) are known as limit geometries of X. We also refer to 
the maps Fx themselves as limit diffeomorphisms. 

Heuristically, the reason for the validity of Sullivan's Theorem is the following: when 
composing fx_„ o • • • o /^_^ , the strongest nonlinear distortion comes from the first map 
applied, fx_i- Since each map is a contraction, fx_i is applied to an interval of length 
exponentially small in i. As i gets large, the nonlinear effect of fx_i becomes negligible 
(the details of this argument require C^^^-regularity of the contractions). 

Note that when the contractions are similarities the maps Fa are all the identity, 
and Sullivan's Theorem becomes trivial. 

If X G A^^, one may consider the conjugated IFS 

= {FxfiF-^ : i G A}, 

whose attractor is the limit geometry Fx{X). More generally, if ^x is the coding map 
for Ix, then ^x = Fx^- 

Corollary 11.5. (i) Choose a € A*, and write a = {. . . aa) G A~^. Then fafaf^^ 
is an affine map with derivative X{fa)- 
(ii) Let g G C-'^([0, 1], M), and consider the conjugated IFS T' = {gfig^^ : i G A}. 
Then for any x G A^^, Ix = I'x- In particular, this holds when g is itself a 
limit diffeomorphism Fy. 
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Proof, (i) Write T„ = Tf^„ , where a" is the n-fold concatenation of a. We have 

fafaf^^ = liuiTnfa^faf^^ 

= Um [TnT^^j) Tn+lfa'^+if^-^ 

which is afhne as a hmit of afhne maps. The last assertion follows since A(-) is 
conjugacy-invariant. 

(ii) Limit diffeomorphisms are clearly invariant under afhne changes of coordinates 
and, being an infinitesimal property of the IFS, therefore also under smooth changes 
of coordinates. □ 

Next, we state a version of the well-known principle of bounded distortion. For the 
rest of the section, the constants implicit in the O(-) notation depend only on the IFSs 
involved. 

Proposition 11.6. Let I = {fi : i £ A} be a regular IFS. Then for any finite word 
a G A* and any i,j G A, 

dist{I{ai),I{aj)) = Q{\I{a)\), 

and 

|/(a.)I = e(|/(a)l). 

The next step is to relate fi to its micromeasures. Recall that the family of mi- 
cromeasures of a measure rj on is denoted (rj) . The following theorem is analogous 
to Proposition 19.11 

Proposition 11.7. For any v € (/i) which is not supported on {0, 1}, there is a limit 
diffeomorphism F and an interval J such that 

Ffi ~c' M"^ 

Proof. By definition, there are intervals /„ C [0, 1] such that 

u = lim /i^" , 

n^oo 

where /i^ = Ti{fii) is the rescaling of /// back to the unit interval. For each n let a^^^ 
be a minimal word such that /(a*^"-*) C /„ (there could be several such words; pick 
one of them). Since is not supported on {0, 1}, this is well-defined for large n, and 
moreover = 0(|/(a("))|) by bounded distortion (i.e. Proposition II l.Gh . 
By Lemma 111.31 writing = /„{„) , 

,/(a(")) 
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On the other hand, for each n there is an interval J„ (corresponding to the relative 
position of /(a^")) inside /„) such that \ Jn\ = ©(1), and 

By passing to a subsequence we can assume that Jn converges to an affine map J, and 
/* to a limit diffeomorphism F . Taking weak limits yields the proposition. □ 

11.3. Proof of Theorem 111.21 Throughout this section, /Ui,/X2 are quasi-product 
measures associated to regular IFSs Zi = {/j*^ : j G Aj}, i = 1,2. We write /i = /ii x ^2 
and 

7 = min(l, dim ^u). 

We say that F : [0, 1]^ ^ [0, 1]^ is a limit diffeomorphism if F = Fi x F2 where Fj is a 
limit diffeomorphism for 2j. 



We next state the main three steps in the proof of Theorem 111.21 The first one is, 
as usual, a topological version of the projection theorem, which does not require any 
minimality assumptions; compare with Proposition 19.31 Given a measure v on and 
a > 0, let 

(11.2) tlci{i') = interior e C^(supp;U, M'^) : d\m^gu > a}. 

Note that these sets are by definition open, though a priori may be empty. 

Proposition 11.8. For every e > 0, there exists a limit diffeomorphism F such that 

U.y-e{Ffj.) contains a dense (and automatically open) subset 0/112,1. 

Identify n2,i\{7r2;, tTj^} with M\{0} via s —i- TTs{x,y) = x + sy (the latter is not 
technically an element of 112,1 but there is an obvious identification). This is similar 
but not the same as the identification in Section [TOl Given K S> 1, write 

Ik = [-K,-K~^]U[K~\K]. 

Using the irreducibility assumption to move the open and dense set around as usual 
(but with some extra technical complications), one obtains: 



Proposition 11.9. If additionally the minimality condition in Theorem \1.4\ holds, then 
for every e > and every K > 1 there is a limit diffeomorphism F such that 

{iTs:selK}Q U^-siFfi). 

A priori, this holds for just one limit diffeomorphism F = Fi x F2. But this auto- 
matically implies it holds in fact for all limit diffeomorphisms: 

Proposition 11.10. Under the hypotheses of Proposition [TOi for every e > 0, every 
K > 1, and every limit diffeomorphism F , 
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Before proving these propositions we use them to deduce Theorem lll.2[ 
We say that a map A : — > is affine diagonal if it is affine with a diagonal 
hnear part. In this case, we can write A = HD, where H is an homothety and 
D{x,y) = {x,ay) for some a G M. We refer to a as the eccentricity of A. 

Lemma 11.11. Let v^t he two measures on , and suppose that Au r^c tq for some 
affine diagonal map A and some set Q with t{Q) > 0. Then for any a, 

Ua{y) '2{gA:g(i ^/„(r)}, 

Uaiv) n 112,1 5 {T^sa ■ T^s G i^air)}, 

where a is the eccentricity of A. 

Proof. The first part is immediate. For the second, note that if D{x, y) = {x, ay) 
then TTgD = TTas, and note that homotheties commute with hnear maps and preserve 
dimension. □ 

Proof of Theorem Ml.^ (assuming Proposition \ll.lO\) . By Corollary [331 it is enough to 
show that 

dim*(7r(;Ui x ^2)) > 7- 
Fix e > and K > 1 for the rest of the proof. For i = 1, 2 let x*^*^ € A^^, and write 
Fi = F^w, F = Fix F2. Since, by Proposition [ILM 

(11.3) {TTs:selK}^U^-e{Ffi), 

we have by compactness of Ik and openness oiU^-e{F ^) that the same is true if one 
replaces F by a sufficiently close diffeomorphism. Since the convergence of fy„,,,y^ to 
Fy is uniform, it follows that that for sufficiently long initial segments ai, 02 of x^^\x^'^^ 
(where the threshold length is also independent of x^*^), if we write /* = f*^ x f*^, 
then (|11.3I1 holds with f*fi instead of Ffj,. 

By the principle of bounded distortion, we can choose the lengths of ai such that 
/(ai), 1(02) have lengths which differ by a factor of 0(1), and still ensure that the length 
of both is bounded above independently of other parameters. Consider now the affine 
diagonal map A = Tj^^ai) x ^7(02) j i-^- A is such that /* = A/, where f = fai x fa2- 
By construction the eccentricity of A is 0(1). Since A[f ix) = we get from Lemma 
[mi] that 

{vTs : S G Ik/0{1)] ^ Z^7-£(//^)- 
Recall that //i ~c lJ'i{ai)xi{a2) the quasi-product property. We may cover supp(^) 
by rectangles of the form /(ai) x 1(02), with 01,02 chosen to be long enough for the 
above to hold. Thus since, e and K were arbitrary, the proof is complete. □ 

11.4. Proofs of the remaining propositions. 
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Proof of Proposition \11.8l The proof is analogous to the proof of Proposition 19.31 We 
apply Theorem lT.lOl to obtain an ergodic CP-chain Qn)'^=i such that almost surely 
/ii G (n) and dim//i = dim^. Hence, we can find a micromeasure u such that dimz^ > 
dim/i, and the conclusion of Corollary 18.51 applies to v: given e > 0, the family U-ys^i^) 
has dense intersection with 112,1. 

By compactness, u = vi x V2, where G (/Uj). By Proposition 111.71 we can find 
a limit diffeomorphism F, an affine diagonal map A, and a rectangle Q, such that 
Aix ~c yq- The proposition then follows from Lemma 111. Ill □ 

For the proof of Proposition 111.91 we need the following lemma: 

Lemma 11.12. Suppose U C M\ {0}, and 

{iTs:s£U}Q UaifJ-). 
Then for any limit diffeomorphism F there is a t = t{F) = G(l) such that 

Proof. This is very similar to the proof of Theorem 111.21 above: one uses the fact 
that, up to affine rescaling and C-equivalence, the limit measure Ffi is C^-close to the 
restriction of /i to a small rectangle Q of eccentricity 0(1). Since UaifJ-q) 5 UaifJ-), 
and the rescaling induced on 112,1 by pre-composition with Tq transforms vr^ to tt^s, 
where t is the eccentricity of Q, the lemma follows by applying this procedure to a 
sequence of rectangles Qn of side length tending to and eccentricities converging to 



Proof of Proposition [TOi To begin, note that a priori we cannot "move around" the 
dense set given by Proposition 111.81 as done in e.g. Proposition 19.41 since for this we 
need the action of an affine map, and the IFSs involved are a priori nonlinear. Instead 
we linearize the relevant map in each IFS by passing to a new limit geometry. 

We first present the proof in the case A(/ii^)/A(/i2^)/ ^ Q for some Oj G A*. By 
iterating the IFSs and relabeling we may then assume that ^{fi^^)/^{fi^^)/ ^ Q 

Let F = Fi X F2 be the limit diffeomorphism given by Proposition 111. 81 The 
conjug ated IFS = {Fiffpr^ : j G AJ satisfies the same hypotheses as the original 
one. Let x = (...,1,1) G A^^^. By Corollary lll.5l (i). the first map in each limit 
IFS {'Ii)x is affine. But by Corollary Ill.Sr ii). {Ii)x = i^Dx- On the other hand, 
by Lemma [11.121 the conclusion of Proposition 111.81 is inherited by (X-)i,.. We have 
therefore shown that there is no loss of generality in assuming that the conjugated 
IFSs {g^j^ = Fifj^^Ff^ : j G Aj} are such that g\^\ g^^ are affine. Moreover, we still 



some t = 0(1). 



□ 



have 



(11.4) 
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Now we are ready to cover the whole of Ik by using the action of g^^ xg^^ on Uj^i;{Ffi). 
By (|11.4p , the collection of eccentricities of the (afhne diagonal) maps 

is dense in (0, cxd). Also note that 

where Qni,n2 = 5^1,712 ([Oj 1]^)- It then follows from Lemma [11.111 that there is N such 
that 

N 

{tts ■■ s e Ik} U l^^-e{{Ffi)Q^^,J. 

ni ,n2=l 

Pick Ni, N2 > N such that the eccentricity of gNi,N2 is less than, say, 2. By the above, 
U-y-e (^(-^/^)qjvj ivj) contains {tTs : s G Ik}- But then using Lemma [11.111 again we 
conclude that U^-e{Fl^) contains {tTs : s S Ik/2}-, as desired. 

In the general case we can still, by minimality and Proposition 111.81 find a, € A*, 
i = 1, 2, and an N such that 

N 

fg') « : I, € M,-,(Fm) ^ 



ni,n2=l 



By Lemma 111.121 the same holds after passing to the limit IFSs which linearize the 
maps /a/ , at the cost of replacing Ik by Iq,(^k) - The argument then proceeds in the 
same manner as before. □ 



Proof of Proposition \11.1^ By Lemma [11.121 \iUa{l^) contains {tTs : s € Ik}, then 
the same is true with K replaced by n{K) and /i replaced by F/i for any limit dif- 
feomorphism F . We know that Ua{G^) contains Ik for some limit diffeomorphism G. 
Thus Corollary lll.5l (ii) and the previous observation applied to the conjugated IFSs 
{Gifj^^G^^ : i G Aj} (i.e. to the measure Gfx) yield the result. □ 
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